visual question answering
1000 papers
Also known as
VQAI
OK-VQA
VQA
VIDEOQA
TEXTVQA
IMAGEQA
Co-occurring keywords
Papers
Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation
CVPR 2025
GroundingFace: Fine-grained Face Understanding via Pixel Grounding Multimodal Large Language Model
CVPR 2025
AdaDARE-gamma: Balancing Stability and Plasticity in Multi-modal LLMs through Efficient Adaptation
CVPR 2025
VRSBench: A Versatile Vision-Language Benchmark Dataset for Remote Sensing Image Understanding
NIPS 2024
Multi-Level Information Retrieval Augmented Generation for Knowledge-based Visual Question Answering
EMNLP 2024