Papers
Towards End-to-End Unified Recognition for Mandarin and Cantonese
INTERSPEECH 2024
Speech Recognition for Greek Dialects: A Challenging Benchmark
INTERSPEECH 2024
Towards Multilingual Audio-Visual Question Answering
INTERSPEECH 2024
XTTS: a Massively Multilingual Zero-Shot Text-to-Speech Model
INTERSPEECH 2024