Co-occurring keywords
Papers
BCAmirs at SemEval-2024 Task 4: Beyond Words: A Multimodal and Multilingual Exploration of Persuasion in Memes
SEMEVAL 2024
JourneyBench: A Challenging One-Stop Vision-Language Understanding Benchmark of Generated Images
NIPS 2024
IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning
EMNLP 2024
Translating speech with just images
INTERSPEECH 2024
Segment and Caption Anything
CVPR 2024