Papers

8,761 papers found
Optimizing Large-Scale Context Retrieval for End-to-End ASR
Zhiqi Huang, Diamantino Caseiro, Kandarp Joshi et al.
2024 INTERSPEECH
2024 INTERSPEECH
Out-of-distribution generalisation in spoken language understanding
Dejan Porjazovski, Anssi Moisio, Mikko Kurimo
2024 INTERSPEECH
2024 INTERSPEECH
2024 INTERSPEECH
PAM: Prompting Audio-Language Models for Audio Quality Assessment
Soham Deshmukh, Dareen Alharthi, Benjamin Elizalde et al.
2024 INTERSPEECH
2024 INTERSPEECH
2024 INTERSPEECH
PARIS: Pseudo-AutoRegressIve Siamese Training for Online Speech Separation
Zexu Pan, Gordon Wichern, François G. Germain et al.
2024 INTERSPEECH
2024 INTERSPEECH
Perceptual Learning in Lexical Tone: Phonetic Similarity vs. Phonological Categories
Ariëlle Reitsema, Chenxin Li, Leanne van Lambalgen et al.
2024 INTERSPEECH
Performant ASR Models for Medical Entities in Accented Speech
Tejumade Afonja, Tobi Olatunji, Sewade Ogun et al.
2024 INTERSPEECH
PERSONA: an application for emotion recognition, gender recognition and age estimation
Devyani Koshal, Orchid Chetia Phukan, Sarthak Jain et al.
2024 INTERSPEECH
Phoneme Discretized Saliency Maps for Explainable Detection of AI-Generated Voice
Shubham Gupta, Mirco Ravanelli, Pascal Germain et al.
2024 INTERSPEECH