Papers
Learning Speech Models from Multi-Modal Data
INTERSPEECH 2021
Temporal Context in Speech Emotion Recognition
INTERSPEECH 2021
AvaTr: One-Shot Speaker Extraction with Transformers
INTERSPEECH 2021
Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset
INTERSPEECH 2021
Towards Automatic Speech to Sign Language Generation
INTERSPEECH 2021
Speech Emotion Recognition via Multi-Level Cross-Modal Distillation
INTERSPEECH 2021