speech recognition

1223 papers

Explore in graph

Also known as

STT WER HSR SRS ASR SR

Co-occurring keywords

automatic speech recognition (1764) word error rate (406) acoustic model (277) speech translation (413) multimodal learning (4622) language model (4573) self-supervised learning (3751) machine translation (2472) deep neural network (1801) neural network (6616)

Papers

A Theory of Unsupervised Speech Recognition ACL 2023

PATCorrect: Non-autoregressive Phoneme-augmented Transformer for ASR Error Correction INTERSPEECH 2023

Joint Speech Transcription and Translation: Pseudo-Labeling with Out-of-Distribution Data ACL 2023

GrounDialog: A Dataset for Repair and Grounding in Task-oriented Spoken Dialogues for Language Learning ACL 2023

JHU IWSLT 2023 Multilingual Speech Translation System Description ACL 2023

Fine-tuning mSLAM for the SIGMORPHON 2022 Shared Task on Grapheme-to-Phoneme Conversion ACL 2023

Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot Task Generalization INTERSPEECH 2023

N-best T5: Robust ASR Error Correction using Multiple Input Hypotheses and Constrained Decoding Space INTERSPEECH 2023

Toward Interactive Dictation ACL 2023

What Can an Accent Identifier Learn? Probing Phonetic and Prosodic Information in a Wav2vec2-based Accent Identification Model INTERSPEECH 2023

WhiSLU: End-to-End Spoken Language Understanding with Whisper INTERSPEECH 2023

Transformer-based Speech Recognition Models for Oral History Archives in English, German, and Czech INTERSPEECH 2023

Matesub: The Translated Subtitling Tool at the IWSLT2023 Subtitling Task ACL 2023

Yet Another Model for Arabic Dialect Identification EMNLP 2023

Towards Multi-task Learning of Speech and Speaker Recognition INTERSPEECH 2023

Speech Aware Dialog System Technology Challenge (DSTC11) INTERSPEECH 2023

Discrimination of the Different Intents Carried by the Same Text Through Integrating Multimodal Information INTERSPEECH 2023

On-the-Fly Feature Based Rapid Speaker Adaptation for Dysarthric and Elderly Speech Recognition INTERSPEECH 2023

Target Vocabulary Recognition Based on Multi-Task Learning with Decomposed Teacher Sequences INTERSPEECH 2023

Back Translation for Speech-to-text Translation Without Transcripts ACL 2023

A Personalised Speech Communication Application for Dysarthric Speakers INTERSPEECH 2023

Improved DeepFake Detection Using Whisper Features INTERSPEECH 2023

Hierarchical Fusion for Online Multimodal Dialog Act Classification EMNLP 2023

The BIGAI Offline Speech Translation Systems for IWSLT 2023 Evaluation ACL 2023

Adaptive Contextual Biasing for Transducer Based Streaming Speech Recognition INTERSPEECH 2023