multimodal learning
4622 papers
Also known as
VLM
VLLM
MM
VLA
MLLMS
MLM
MML
MULLM
LMM
MLLM
MMT
Co-occurring keywords
Papers
Interpretabilty of Speech Emotion Recognition modelled using Self-Supervised Speech and Text Pre-Trained Embeddings
INTERSPEECH 2022
Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition for Single and Muti-Person Video
INTERSPEECH 2022
CNN-based Audio Event Recognition for Automated Violence Classification and Rating for Prime Video Content
INTERSPEECH 2022
End-to-End Audio-Visual Neural Speaker Diarization
INTERSPEECH 2022
PM2F2N: Patient Multi-view Multi-modal Feature Fusion Networks for Clinical Outcome Prediction
EMNLP 2022