Co-occurring keywords
Papers
Evaluating Multimodal Large Language Models on Video Captioning via Monte Carlo Tree Search
ACL 2025
Progress-Aware Video Frame Captioning
CVPR 2025
IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning
EMNLP 2024
VIEWS: Entity-Aware News Video Captioning
EMNLP 2024