Co-occurring keywords
Papers
A Picture is Worth More Than 77 Text Tokens: Evaluating CLIP-Style Models on Dense Captions
CVPR 2024
Evaluating Computational Representations of Character: An Austen Character Similarity Benchmark
EMNLP 2024
ERVQA: A Dataset to Benchmark the Readiness of Large Vision Language Models in Hospital Environments
EMNLP 2024