multimodal learning
4622 papers
Also known as
VLM
VLLM
MM
VLA
MLLMS
MLM
MML
MULLM
LMM
MLLM
MMT
Co-occurring keywords
Papers
SaSR-Net: Source-Aware Semantic Representation Network for Enhancing Audio-Visual Question Answering
EMNLP 2024
WikiScenes with Descriptions: Aligning Paragraphs and Sentences with Images in Wikipedia Articles
NAACL 2024
LTRC-IIITH at MEDIQA-M3G 2024: Medical Visual Question Answering with Vision-Language Models
NAACL 2024
WangLab at MEDIQA-M3G 2024: Multimodal Medical Answer Generation using Large Language Models
NAACL 2024
MemeGuard: An LLM and VLM-based Framework for Advancing Content Moderation via Meme Intervention
ACL 2024
Synthetic Multimodal Question Generation
EMNLP 2024