Papers

8,761 papers found
GSQA: An End-to-End Model for Generative Spoken Question Answering
Min-Han Shih, Ho-Lam Chung, Yu-Chi Pai et al.
2024 INTERSPEECH
GTR-Voice: Articulatory Phonetics Informed Controllable Expressive Speech Synthesis
Zehua Kcriss Li, Meiying Melissa Chen, Yi Zhong et al.
2024 INTERSPEECH
2024 INTERSPEECH
Harder or Different? Understanding Generalization of Audio Deepfake Detection
Nicolas M. Müller, Nicholas Evans, Hemlata Tak et al.
2024 INTERSPEECH
Hear Your Face: Face-based voice conversion with F0 estimation
Jaejun Lee, Yoori Oh, Injune Hwang et al.
2024 INTERSPEECH
HebDB: a Weakly Supervised Dataset for Hebrew Speech Processing
Arnon Turetzky, Or Tal, Yael Segal et al.
2024 INTERSPEECH
Hierarchical Multi-Task Learning with CTC and Recursive Operation
Nahomi Kusunoki, Yosuke Higuchi, Tetsuji Ogawa et al.
2024 INTERSPEECH
2024 INTERSPEECH
Hold Me Tight: Stable Encoder-Decoder Design for Speech Enhancement
Daniel Haider, Felix Perfler, Vincent Lostanlen et al.
2024 INTERSPEECH
Homograph Disambiguation with Text-to-Text Transfer Transformer
Markéta Řezáčková, Daniel Tihelka, Jindřich Matoušek
2024 INTERSPEECH
2024 INTERSPEECH
How Do Neural Spoofing Countermeasures Detect Partially Spoofed Audio?
Tianchi Liu, Lin Zhang, Rohan Kumar Das et al.
2024 INTERSPEECH
2024 INTERSPEECH
2024 INTERSPEECH
How Should We Extract Discrete Audio Tokens from Self-Supervised Models?
Pooneh Mousavi, Jarod Duret, Salah Zaiem et al.
2024 INTERSPEECH
2024 INTERSPEECH
Hybrid-Diarization System with Overlap Post-Processing for the DISPLACE 2024 Challenge
Gabriel Pîrlogeanu, Octavian Pascu, Alexandru-Lucian Georgescu et al.
2024 INTERSPEECH