speech recognition

1223 papers

Explore in graph

Also known as

STT WER HSR SRS ASR SR

Co-occurring keywords

automatic speech recognition (1764) word error rate (406) acoustic model (277) speech translation (413) multimodal learning (4622) language model (4573) self-supervised learning (3751) machine translation (2472) deep neural network (1801) neural network (6616)

Papers

Text-Only Domain Adaptation Based on Intermediate CTC INTERSPEECH 2022

End-to-end Speech-to-Punctuated-Text Recognition INTERSPEECH 2022

Minimum latency training of sequence transducers for streaming end-to-end speech recognition INTERSPEECH 2022

DAVIS: Driver’s Audio-Visual Speech recognition INTERSPEECH 2022

Towards Efficiently Learning Monotonic Alignments for Attention-based End-to-End Speech Recognition INTERSPEECH 2022

Domain Prompts: Towards memory and compute efficient domain adaptation of ASR systems INTERSPEECH 2022

Space-Efficient Representation of Entity-centric Query Language Models INTERSPEECH 2022

Detecting Dysfluencies in Stuttering Therapy Using wav2vec 2.0 INTERSPEECH 2022

Extending RNN-T-based speech recognition systems with emotion and language classification INTERSPEECH 2022

Interpretable dysarthric speaker adaptation based on optimal-transport INTERSPEECH 2022

An Anchor-Free Detector for Continuous Speech Keyword Spotting INTERSPEECH 2022

The HW-TSC’s Speech to Speech Translation System for IWSLT 2022 Evaluation ACL 2022

SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing ACL 2022

Self-supervised Semantic-driven Phoneme Discovery for Zero-resource Speech Recognition ACL 2022

FAtNet: Cost-Effective Approach Towards Mitigating the Linguistic Bias in Speaker Verification Systems NAACL 2022

Cue-bot: A Conversational Agent for Assistive Technology ACL 2022

Accelerating Inference and Language Model Fusion of Recurrent Neural Network Transducers via End-to-End 4-bit Quantization INTERSPEECH 2022

Transfer Learning from Multi-Lingual Speech Translation Benefits Low-Resource Speech Recognition INTERSPEECH 2022

Internal Language Model Estimation Through Explicit Context Vector Learning for Attention-based Encoder-decoder ASR INTERSPEECH 2022

Automatically Detecting Reduced-formed English Pronunciations by Using Deep Learning NAACL 2022

Sentence-Select: Large-Scale Language Model Data Selection for Rare-Word Speech Recognition INTERSPEECH 2022

End-to-End Dependency Parsing of Spoken French INTERSPEECH 2022

Latency Control for Keyword Spotting INTERSPEECH 2022

On the Prediction Network Architecture in RNN-T for ASR INTERSPEECH 2022

DEFORMER: Coupling Deformed Localized Patterns with Global Context for Robust End-to-end Speech Recognition INTERSPEECH 2022