Papers
Controlling Emotion in Text-to-Speech with Natural Language Prompts
INTERSPEECH 2024
FLEURS-R: A Restored Multilingual Speech Corpus for Generation Tasks
INTERSPEECH 2024
An End-to-End Approach for Chord-Conditioned Song Generation
INTERSPEECH 2024
MaskSR: Masked Language Model for Full-band Speech Restoration
INTERSPEECH 2024
MaLa-ASR: Multimedia-Assisted LLM-Based ASR
INTERSPEECH 2024
Dataset-Distillation Generative Model for Speech Emotion Recognition
INTERSPEECH 2024
FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter
INTERSPEECH 2024
QGAN: Low Footprint Quaternion Neural Vocoder for Speech Synthesis
INTERSPEECH 2024