Multimodal Learning
13,185 papers
Papers per year
1
3
6
2
5
2
3
6
24
20
46
109
205
299
622
675
987
1084
1697
2500
3655
1234
'10
'15
'20
'25
Papers
STraDa: A Singer Traits Dataset
INTERSPEECH 2024
Multimodal Fusion for Vocal Biomarkers Using Vector Cross-Attention
INTERSPEECH 2024
Domain Adaptation for Contrastive Audio-Language Models
INTERSPEECH 2024
An End-to-End Speech Summarization Using Large Language Model
INTERSPEECH 2024
Custom wake word detection
INTERSPEECH 2024
A demonstrator for articulation-based command word recognition
INTERSPEECH 2024