multimodal learning
4622 papers
Also known as
VLM
VLLM
MM
VLA
MLLMS
MLM
MML
MULLM
LMM
MLLM
MMT
Co-occurring keywords
Papers
Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?
EMNLP 2023
Reading Order Matters: Information Extraction from Visually-rich Documents by Token Path Prediction
EMNLP 2023
DetGPT: Detect What You Need via Reasoning
EMNLP 2023
UniChart: A Universal Vision-language Pretrained Model for Chart Comprehension and Reasoning
EMNLP 2023