Multimodal Learning
13,185 papers
Papers per year
1
3
6
2
5
2
3
6
24
20
46
109
205
299
622
675
987
1084
1697
2500
3655
1234
'10
'15
'20
'25
Papers
Towards Cross-Language Prosody Transfer for Dialog
INTERSPEECH 2023
GigaST: A 10,000-hour Pseudo Speech Translation Corpus
INTERSPEECH 2023
Parsing dialog turns with prosodic features in English
INTERSPEECH 2023
Abusive Speech Detection in Indic Languages Using Acoustic Features
INTERSPEECH 2023
An Improved End-to-End Audio-Visual Speech Recognition Model
INTERSPEECH 2023
Reversible Neural Networks for Memory-Efficient Speaker Verification
INTERSPEECH 2023
Instance-based Temporal Normalization for Speaker Verification
INTERSPEECH 2023