Multimodal Learning
13,185 papers
Papers per year
1
3
6
2
5
2
3
6
24
20
46
109
205
299
622
675
987
1084
1697
2500
3655
1234
'10
'15
'20
'25
Papers
Text-aware Speech Separation for Multi-talker Keyword Spotting
INTERSPEECH 2024
Translating speech with just images
INTERSPEECH 2024
ZeroST: Zero-Shot Speech Translation
INTERSPEECH 2024
Towards EMG-to-Speech with Necklace Form Factor
INTERSPEECH 2024
Auditory Attention Decoding in Four-Talker Environment with EEG
INTERSPEECH 2024
Familiar and Unfamiliar Speaker Identification in Speech and Singing
INTERSPEECH 2024
Factor-Conditioned Speaking-Style Captioning
INTERSPEECH 2024
Contrastive Feedback Mechanism for Simultaneous Speech Translation
INTERSPEECH 2024
Stress transfer in speech-to-speech machine translation
INTERSPEECH 2024
Multimodal Belief Prediction
INTERSPEECH 2024