multimodal learning
4622 papers
Also known as
VLM
VLLM
MM
VLA
MLLMS
MLM
MML
MULLM
LMM
MLLM
MMT
Co-occurring keywords
Papers
Speech and Text Analysis for Multimodal Addressee Detection in Human-Human-Computer Interaction
INTERSPEECH 2017
NTCD-TIMIT: A New Database and Baseline for Noise-Robust Audio-Visual Speech Recognition
INTERSPEECH 2017
Parameter estimation of Japanese predicate argument structure analysis model using eye gaze information
COLING 2016
Exploring Collections of Multimedia Archives Through Innovative Interfaces in the Context of Digital Humanities
INTERSPEECH 2016
Dynamic Stream Weighting for Turbo-Decoding-Based Audiovisual ASR
INTERSPEECH 2016
Introducing the Turbo-Twin-HMM for Audio-Visual Speech Enhancement
INTERSPEECH 2016
Combining CNN and BLSTM to Extract Textual and Acoustic Features for Recognizing Stances in Mandarin Ideological Debate Competition
INTERSPEECH 2016
Automatic Genre and Show Identification of Broadcast Media
INTERSPEECH 2016