multimodal learning
4622 papers
Also known as
VLM
VLLM
MM
VLA
MLLMS
MLM
MML
MULLM
LMM
MLLM
MMT
Co-occurring keywords
Papers
Cross-modal Features Interaction-and-Aggregation Network with Self-consistency Training for Speech Emotion Recognition
INTERSPEECH 2024
NUS-Emo at SemEval-2024 Task 3: Instruction-Tuning LLM for Multimodal Emotion-Cause Analysis in Conversations
SEMEVAL 2024
LLM-Driven Multimodal Opinion Expression Identification
INTERSPEECH 2024
Contrastive Feedback Mechanism for Simultaneous Speech Translation
INTERSPEECH 2024
Spontaneous Speech-Based Suicide Risk Detection Using Whisper and Large Language Models
INTERSPEECH 2024
A multimodal analysis of different types of laughter expression in conversational dialogues
INTERSPEECH 2024
Enhancing Automated Audio Captioning via Large Language Models with Optimized Audio Encoding
INTERSPEECH 2024
An End-to-End Speech Summarization Using Large Language Model
INTERSPEECH 2024
Towards Intelligent Speech Assistants in Operating Rooms: A Multimodal Model for Surgical Workflow Analysis
INTERSPEECH 2024
Understanding Sounds, Missing the Questions: The Challenge of Object Hallucination in Large Audio-Language Models
INTERSPEECH 2024
VIEWS: Entity-Aware News Video Captioning
EMNLP 2024