Co-occurring keywords
Papers
Towards Surveillance Video-and-Language Understanding: New Dataset Baselines and Challenges
CVPR 2024
Unveiling Opinion Evolution via Prompting and Diffusion for Short Video Fake News Detection
ACL 2024
PERIA: Perceive, Reason, Imagine, Act via Holistic Language and Vision Planning for Manipulation
NIPS 2024
On the Robustness of Language Guidance for Low-Level Vision Tasks: Findings from Depth Estimation
CVPR 2024
Multiple-Question Multiple-Answer Text-VQA
NAACL 2024