Multimodal Learning
13,057 papers
Papers per year
1
3
6
2
5
2
3
6
24
20
46
109
205
299
622
675
987
1084
1697
2500
3654
1107
'10
'15
'20
'25
Papers
Learning Audio-Text Agreement for Open-vocabulary Keyword Spotting
INTERSPEECH 2022
Application for Real-time Personalized Speaker Extraction
INTERSPEECH 2022
Context-aware Multimodal Fusion for Emotion Recognition
INTERSPEECH 2022
A Multimodal Strategy for Singing Language Identification
INTERSPEECH 2022
Comparison of Models for Detecting Off-Putting Speaking Styles
INTERSPEECH 2022
Visually-aware Acoustic Event Detection using Heterogeneous Graphs
INTERSPEECH 2022
On Breathing Pattern Information in Synthetic Speech
INTERSPEECH 2022
Attacker Attribution of Audio Deepfakes
INTERSPEECH 2022
Word Discovery in Visually Grounded, Self-Supervised Speech Models
INTERSPEECH 2022