multimodal learning
4622 papers
Also known as
VLM
VLLM
MM
VLA
MLLMS
MLM
MML
MULLM
LMM
MLLM
MMT
Co-occurring keywords
Papers
ALCAP: Alignment-Augmented Music Captioner
EMNLP 2023
ORANGE: Text-video Retrieval via Watch-time-aware Heterogeneous Graph Contrastive Learning
EMNLP 2023
IC3: Image Captioning by Committee Consensus
EMNLP 2023
GazeVQA: A Video Question Answering Dataset for Multiview Eye-Gaze Task-Oriented Collaborations
EMNLP 2023