Multimodal Learning
13,185 papers
Papers per year
1
3
6
2
5
2
3
6
24
20
46
109
205
299
622
675
987
1084
1697
2500
3655
1234
'10
'15
'20
'25
Papers
GAN-Based Data Generation for Speech Emotion Recognition
INTERSPEECH 2020
FaceFilter: Audio-Visual Speech Separation Using Still Images
INTERSPEECH 2020
Caption Alignment for Low Resource Audio-Visual Data
INTERSPEECH 2020
Phonetic Entrainment in Cooperative Dialogues: A Case of Russian
INTERSPEECH 2020