visual question answering
1000 papers
Also known as
VQAI
OK-VQA
VQA
VIDEOQA
TEXTVQA
IMAGEQA
Co-occurring keywords
Papers
“Image, Tell me your story!” Predicting the original meta-context of visual misinformation
EMNLP 2024
What If the TV Was Off? Examining Counterfactual Reasoning Abilities of Multi-modal Language Models
CVPR 2024
Negative Object Presence Evaluation (NOPE) to Measure Object Hallucination in Vision-Language Models
ACL 2024
Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models
CVPR 2024
CoG-DQA: Chain-of-Guiding Learning with Large Language Models for Diagram Question Answering
CVPR 2024