Co-occurring keywords
Papers
PATCorrect: Non-autoregressive Phoneme-augmented Transformer for ASR Error Correction
INTERSPEECH 2023
Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot Task Generalization
INTERSPEECH 2023
N-best T5: Robust ASR Error Correction using Multiple Input Hypotheses and Constrained Decoding Space
INTERSPEECH 2023
Toward Interactive Dictation
ACL 2023
What Can an Accent Identifier Learn? Probing Phonetic and Prosodic Information in a Wav2vec2-based Accent Identification Model
INTERSPEECH 2023
WhiSLU: End-to-End Spoken Language Understanding with Whisper
INTERSPEECH 2023
Transformer-based Speech Recognition Models for Oral History Archives in English, German, and Czech
INTERSPEECH 2023
Towards Multi-task Learning of Speech and Speaker Recognition
INTERSPEECH 2023
Speech Aware Dialog System Technology Challenge (DSTC11)
INTERSPEECH 2023
Discrimination of the Different Intents Carried by the Same Text Through Integrating Multimodal Information
INTERSPEECH 2023
On-the-Fly Feature Based Rapid Speaker Adaptation for Dysarthric and Elderly Speech Recognition
INTERSPEECH 2023
Target Vocabulary Recognition Based on Multi-Task Learning with Decomposed Teacher Sequences
INTERSPEECH 2023
Improved DeepFake Detection Using Whisper Features
INTERSPEECH 2023