multimodal learning
4622 papers
Also known as
VLM
VLLM
MM
VLA
MLLMS
MLM
MML
MULLM
LMM
MLLM
MMT
Co-occurring keywords
Papers
monoVLN: Bridging the Observation Gap between Monocular and Panoramic Vision and Language Navigation
ICCV 2025
ExpertNeurons at SciVQA-2025: Retrieval Augmented VQA with Vision Language Model (RAVQA-VLM)
ACL 2025
How Would It Sound? Material-Controlled Multimodal Acoustic Profile Generation for Indoor Scenes
ICCV 2025
InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning
EMNLP 2025
HintsOfTruth: A Multimodal Checkworthiness Detection Dataset with Real and Synthetic Claims
ACL 2025
What Is That Talk About? A Video-to-Text Summarization Dataset for Scientific Presentations
ACL 2025