Co-occurring keywords
Papers
AbsVis – Benchmarking How Humans and Vision-Language Models “See” Abstract Concepts in Images
EMNLP 2025
VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning
AAAI 2025