visual question answering
1000 papers
Also known as
VQAI
OK-VQA
VQA
VIDEOQA
TEXTVQA
IMAGEQA
Co-occurring keywords
Papers
GLEN: Generalized Focal Loss Ensemble of Low-Rank Networks for Calibrated Visual Question Answering
AAAI 2025
SpatialLLM: A Compound 3D-Informed Design towards Spatially-Intelligent Large Multimodal Models
CVPR 2025