multimodal learning
4622 papers
Also known as
VLM
VLLM
MM
VLA
MLLMS
MLM
MML
MULLM
LMM
MLLM
MMT
Co-occurring keywords
Papers
CLEVR-POC: Reasoning-Intensive Visual Question Answering in Partially Observable Environments
COLING 2024
Towards Surveillance Video-and-Language Understanding: New Dataset Baselines and Challenges
CVPR 2024
Few-Shot Multimodal Named Entity Recognition Based on Mutlimodal Causal Intervention Graph
COLING 2024
MMAD:Multi-modal Movie Audio Description
COLING 2024
On the Robustness of Language Guidance for Low-Level Vision Tasks: Findings from Depth Estimation
CVPR 2024
From Multimodal LLM to Human-level AI: Modality, Instruction, Reasoning, Efficiency and beyond
COLING 2024