multimodal learning
4622 papers
Also known as
VLM
VLLM
MM
VLA
MLLMS
MLM
MML
MULLM
LMM
MLLM
MMT
Co-occurring keywords
Papers
Does Visual Grounding Enhance the Understanding of Embodied Knowledge in Large Language Models?
EMNLP 2025
Judge and Improve: Towards a Better Reasoning of Knowledge Graphs with Large Language Models
EMNLP 2025
ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents
EMNLP 2025
AI Knows Where You Are: Exposure, Bias, and Inference in Multimodal Geolocation with KoreaGEO
EMNLP 2025
Stop Looking for “Important Tokens” in Multimodal Language Models: Duplication Matters More
EMNLP 2025
Evaluating LLM-Generated Diagrams as Graphs
EMNLP 2025