Co-occurring keywords
Papers
ALCAP: Alignment-Augmented Music Captioner
EMNLP 2023
ORANGE: Text-video Retrieval via Watch-time-aware Heterogeneous Graph Contrastive Learning
EMNLP 2023
IC3: Image Captioning by Committee Consensus
EMNLP 2023
GazeVQA: A Video Question Answering Dataset for Multiview Eye-Gaze Task-Oriented Collaborations
EMNLP 2023