visual question answering
1000 papers
Also known as
VQAI
OK-VQA
VQA
VIDEOQA
TEXTVQA
IMAGEQA
Co-occurring keywords
Papers
Forensics-Bench: A Comprehensive Forgery Detection Benchmark Suite for Large Vision Language Models
CVPR 2025
Multimodal Commonsense Knowledge Distillation for Visual Question Answering (Student Abstract)
AAAI 2025
VisFinEval: A Scenario-Driven Chinese Multimodal Benchmark for Holistic Financial Understanding
EMNLP 2025
WebMMU: A Benchmark for Multimodal Multilingual Website Understanding and Code Generation
EMNLP 2025