multimodal learning
4622 papers
Also known as
VLM
VLLM
MM
VLA
MLLMS
MLM
MML
MULLM
LMM
MLLM
MMT
Co-occurring keywords
Papers
UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing
CVPR 2025
SlideChat: A Large Vision-Language Assistant for Whole-Slide Pathology Image Understanding
CVPR 2025
Recoverable Compression: A Multimodal Vision Token Recovery Mechanism Guided by Text Information
AAAI 2025
A High-Quality Text-Rich Image Instruction Tuning Dataset via Hybrid Instruction Generation
COLING 2025
Referring to Any Person
ICCV 2025
OVEL: Online Video Entity Linking
COLING 2025