Co-occurring keywords
Papers
Text-Only Domain Adaptation Based on Intermediate CTC
INTERSPEECH 2022
End-to-end Speech-to-Punctuated-Text Recognition
INTERSPEECH 2022
Minimum latency training of sequence transducers for streaming end-to-end speech recognition
INTERSPEECH 2022
DAVIS: Driver’s Audio-Visual Speech recognition
INTERSPEECH 2022
Towards Efficiently Learning Monotonic Alignments for Attention-based End-to-End Speech Recognition
INTERSPEECH 2022
Domain Prompts: Towards memory and compute efficient domain adaptation of ASR systems
INTERSPEECH 2022
Detecting Dysfluencies in Stuttering Therapy Using wav2vec 2.0
INTERSPEECH 2022
Extending RNN-T-based speech recognition systems with emotion and language classification
INTERSPEECH 2022
An Anchor-Free Detector for Continuous Speech Keyword Spotting
INTERSPEECH 2022
Accelerating Inference and Language Model Fusion of Recurrent Neural Network Transducers via End-to-End 4-bit Quantization
INTERSPEECH 2022
Transfer Learning from Multi-Lingual Speech Translation Benefits Low-Resource Speech Recognition
INTERSPEECH 2022
Internal Language Model Estimation Through Explicit Context Vector Learning for Attention-based Encoder-decoder ASR
INTERSPEECH 2022
Sentence-Select: Large-Scale Language Model Data Selection for Rare-Word Speech Recognition
INTERSPEECH 2022
End-to-End Dependency Parsing of Spoken French
INTERSPEECH 2022
Latency Control for Keyword Spotting
INTERSPEECH 2022
On the Prediction Network Architecture in RNN-T for ASR
INTERSPEECH 2022