multimodal learning
4622 papers
Also known as
VLM
VLLM
MM
VLA
MLLMS
MLM
MML
MULLM
LMM
MLLM
MMT
Co-occurring keywords
Papers
Unified Coding for Both Human Perception and Generalized Machine Analytics with CLIP Supervision
AAAI 2025
Rethinking Vision-Language Model in Face Forensics: Multi-Modal Interpretable Forged Face Detector
CVPR 2025
Towards Video Thinking Test: A Holistic Benchmark for Advanced Video Reasoning and Understanding
ICCV 2025
Driving by the Rules: A Benchmark for Integrating Traffic Sign Regulations into Vectorized HD Map
CVPR 2025
RCTrans: Radar-Camera Transformer via Radar Densifier and Sequential Decoder for 3D Object Detection
AAAI 2025
IDEA: Inverted Text with Cooperative Deformable Aggregation for Multi-modal Object Re-Identification
CVPR 2025