Papers

8,761 papers found
Motion Based Audio-Visual Segmentation
Jiahao Li, Miao Liu, Shu Yang et al.
2024 INTERSPEECH
2024 INTERSPEECH
2024 INTERSPEECH
2024 INTERSPEECH
Multi-Channel Extension of Pre-trained Models for Speaker Verification
Ladislav Mošner, Romain Serizel, Lukáš Burget et al.
2024 INTERSPEECH
Multi-Channel Multi-Speaker ASR Using Target Speaker’s Solo Segment
Yiwen Shao, Shi-Xiong Zhang, Yong Xu et al.
2024 INTERSPEECH
MULTI-CONVFORMER: Extending Conformer with Multiple Convolution Kernels
Darshan Prabhu, Yifan Peng, Preethi Jyothi et al.
2024 INTERSPEECH
2024 INTERSPEECH
2024 INTERSPEECH
2024 INTERSPEECH
Multi-modal Adversarial Training for Zero-Shot Voice Cloning
John Janiczek, Dading Chong, Dongyang Dai et al.
2024 INTERSPEECH
Multimodal Belief Prediction
John Murzaku, Adil Soubki, Owen Rambow
2024 INTERSPEECH
2024 INTERSPEECH
Multimodal Fusion for Vocal Biomarkers Using Vector Cross-Attention
Vladimir Despotovic, Abir Elbéji, Petr V. Nazarov et al.
2024 INTERSPEECH
2024 INTERSPEECH