multimodal learning
4622 papers
Also known as
VLM
VLLM
MM
VLA
MLLMS
MLM
MML
MULLM
LMM
MLLM
MMT
Co-occurring keywords
Papers
dutir914 at SemEval-2025 Task 1: An integrated approach for Multimodal Idiomaticity Representations
ACL 2025
Temporal Working Memory: Query-Guided Segment Refinement for Enhanced Multimodal Understanding
NAACL 2025
Multi-Condition Guided Diffusion Network for Multimodal Emotion Recognition in Conversation
NAACL 2025
Driving by the Rules: A Benchmark for Integrating Traffic Sign Regulations into Vectorized HD Map
CVPR 2025
Rethinking Vision-Language Model in Face Forensics: Multi-Modal Interpretable Forged Face Detector
CVPR 2025
IDEA: Inverted Text with Cooperative Deformable Aggregation for Multi-modal Object Re-Identification
CVPR 2025
Diving into Mitigating Hallucinations from a Vision Perspective for Large Vision-Language Models
EMNLP 2025
SimpleDoc: Multi‐Modal Document Understanding with Dual‐Cue Page Retrieval and Iterative Refinement
EMNLP 2025
FJWU_Squad at SemEval-2025 Task 1: An Idiom Visual Understanding Dataset for Idiom Learning
ACL 2025