visual question answering
1000 papers
Also known as
VQAI
OK-VQA
VQA
VIDEOQA
TEXTVQA
IMAGEQA
Co-occurring keywords
Papers
BanglaProtha: Evaluating Vision Language Models in Underrepresented Long-tail Cultural Contexts
WACV 2026
LLaVA³: Representing 3D Scenes Like a Cubist Painter to Boost 3D Scene Understanding of VLMs
AAAI 2026
DRIVINGVQA: A Dataset for Interleaved Visual Chain-of-Thought in Real-World Driving Scenarios
EACL 2026
MangaVQA and MangaLMM: A Benchmark and Specialized Model for Multimodal Manga Understanding
EACL 2026
Visual–Linguistic Abductive Reasoning with LLMs for Knowledge-based Visual Question Answering
EACL 2026
Relevance-aware Multi-context Contrastive Decoding for Retrieval-augmented Visual Question Answering
WACV 2026