Co-occurring keywords
Papers
Don’t Buy it! Reassessing the Ad Understanding Abilities of Contrastive Multimodal Models
ACL 2024
ZEETAD: Adapting Pretrained Vision-Language Model for Zero-Shot End-to-End Temporal Action Detection
WACV 2024
Detecting Temporal Ambiguity in Questions
EMNLP 2024