Papers

8,761 papers found
Revisiting Convolution-free Transformer for Speech Recognition
Zejiang Hou, Goeric Huybrechts, Anshu Bhatia et al.
2024 INTERSPEECH
Revisiting Pitch Jumps: F0 Ratio in Seoul Korean
Michaela Watkins, Paul Boersma, Silke Hamann
2024 INTERSPEECH
RIR-in-a-Box: Estimating Room Acoustics from 3D Mesh Data through Shoebox Approximation
Liam Kelley, Diego Di Carlo, Aditya Arie Nugraha et al.
2024 INTERSPEECH
ROAR: Reinforcing Original to Augmented Data Ratio Dynamics for Wav2vec2.0 Based ASR
Vishwanath Pratap Singh, Federico Malato, Ville Hautamäki et al.
2024 INTERSPEECH
2024 INTERSPEECH
RT-LA-VocE: Real-Time Low-SNR Audio-Visual Speech Enhancement
Honglie Chen, Rodrigo Mira, Stavros Petridis et al.
2024 INTERSPEECH
2024 INTERSPEECH
SALSA: Speedy ASR-LLM Synchronous Aggregation
Ashish Mittal, Darshan Prabhu, Sunita Sarawagi et al.
2024 INTERSPEECH
SAML: Speaker Adaptive Mixture of LoRA Experts for End-to-End ASR
Qiuming Zhao, Guangzhi Sun, Chao Zhang et al.
2024 INTERSPEECH
Sample-Efficient Diffusion for Text-To-Speech Synthesis
Justin Lovelace, Soham Ray, Kwangyoun Kim et al.
2024 INTERSPEECH
SAMSEMO: New dataset for multilingual and multimodal emotion recognition
Pawel Bujnowski, Bartlomiej Kuzma, Bartlomiej Paziewski et al.
2024 INTERSPEECH
2024 INTERSPEECH
Scaling up masked audio encoder learning for general audio classification
Heinrich Dinkel, Zhiyong Yan, Yongqing Wang et al.
2024 INTERSPEECH
2024 INTERSPEECH
Schrödinger Bridge for Generative Speech Enhancement
Ante Jukić, Roman Korostik, Jagadeesh Balam et al.
2024 INTERSPEECH
2024 INTERSPEECH