multimodal learning
4622 papers
Also known as
VLM
VLLM
MM
VLA
MLLMS
MLM
MML
MULLM
LMM
MLLM
MMT
Co-occurring keywords
Papers
Spotting the Unexpected (STU): A 3D LiDAR Dataset for Anomaly Segmentation in Autonomous Driving
CVPR 2025
KEC_AI_ZEROWATTS@DravidianLangTech 2025: Multimodal Hate Speech Detection in Dravidian languages
NAACL 2025
Tri-Ergon: Fine-Grained Video-to-Audio Generation with Multi-Modal Conditions and LUFS Control
AAAI 2025
MammAlps: A Multi-view Video Behavior Monitoring Dataset of Wild Mammals in the Swiss Alps
CVPR 2025
Multimodal Fine-grained Context Interaction Graph Modeling for Conversational Speech Synthesis
EMNLP 2025
ViDove: A Translation Agent System with Multimodal Context and Memory-Augmented Reasoning
EMNLP 2025
TSAM: Temporal SAM Augmented with Multimodal Prompts for Referring Audio-Visual Segmentation
CVPR 2025