Multimodal Learning
13,185 papers
Papers per year
1
3
6
2
5
2
3
6
24
20
46
109
205
299
622
675
987
1084
1697
2500
3655
1234
'10
'15
'20
'25
Papers
PAM: Prompting Audio-Language Models for Audio Quality Assessment
INTERSPEECH 2024
AVR: synergizing foundation models for audio-visual humor detection
INTERSPEECH 2024
NeuRO: an application for code-switched autism detection in children
INTERSPEECH 2024
Do Speaker-dependent Vowel Characteristics depend on Speech Style?
INTERSPEECH 2024
Prosodic marking of syntactic boundaries in Khoekhoe
INTERSPEECH 2024
Affricates in Lushootseed
INTERSPEECH 2024
A Transformer-Based Voice Activity Detector
INTERSPEECH 2024
Towards Multilingual Audio-Visual Question Answering
INTERSPEECH 2024
On the Effectiveness of Acoustic BPE in Decoder-Only TTS
INTERSPEECH 2024