Multimodal Learning
13,185 papers
Papers per year
1
3
6
2
5
2
3
6
24
20
46
109
205
299
622
675
987
1084
1697
2500
3655
1234
'10
'15
'20
'25
Papers
MMER: Multimodal Multi-task Learning for Speech Emotion Recognition
INTERSPEECH 2023
Delay-penalized CTC Implemented Based on Finite State Transducer
INTERSPEECH 2023
Improving Joint Speech-Text Representations Without Alignment
INTERSPEECH 2023
Query Based Acoustic Summarization for Podcasts
INTERSPEECH 2023
Turbo your multi-modal classification with contrastive learning
INTERSPEECH 2023
CFVC: Conditional Filtering for Controllable Voice Conversion
INTERSPEECH 2023
Cross-utterance Conditioned Coherent Speech Editing
INTERSPEECH 2023