Co-occurring keywords
Papers
Exploring Question Guidance and Answer Calibration for Visually Grounded Video Question Answering
EMNLP 2024
GTA: A Benchmark for General Tool Agents
NIPS 2024