Multimodal Learning
13,185 papers
Papers per year
1
3
6
2
5
2
3
6
24
20
46
109
205
299
622
675
987
1084
1697
2500
3655
1234
'10
'15
'20
'25
Papers
Singing Voice Graph Modeling for SingFake Detection
INTERSPEECH 2024
Towards realtime co-speech gestures synthesis using STARGATE
INTERSPEECH 2024
PPPR: Portable Plug-in Prompt Refiner for Text to Audio Generation
INTERSPEECH 2024