Multi-Modal Learning
1,215 papers
Papers per year
2
1
1
2
5
5
1
5
8
21
42
42
69
72
149
143
258
371
18
'15
'20
'25
Papers
Distilling a Pretrained Language Model to a Multilingual ASR Model
INTERSPEECH 2022
Word Discovery in Visually Grounded, Self-Supervised Speech Models
INTERSPEECH 2022
ASR2K: Speech Recognition for Around 2000 Languages without Audio
INTERSPEECH 2022
Guiding Visual Question Generation
NAACL 2022