visual question answering
1000 papers
Also known as
VQAI
OK-VQA
VQA
VIDEOQA
TEXTVQA
IMAGEQA
Co-occurring keywords
Papers
End-to-End Multi-Modal Diffusion Mamba
ICCV 2025
Ask and Remember: A Questions-Only Replay Strategy for Continual Visual Question Answering
ICCV 2025
Multimodal Commonsense Knowledge Distillation for Visual Question Answering (Student Abstract)
AAAI 2025
WebMMU: A Benchmark for Multimodal Multilingual Website Understanding and Code Generation
EMNLP 2025
GEMeX: A Large-Scale, Groundable, and Explainable Medical VQA Benchmark for Chest X-ray Diagnosis
ICCV 2025