Co-occurring keywords
Papers
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI
CVPR 2024
Algorithmic progress in language models
NIPS 2024
MultiTrust: A Comprehensive Benchmark Towards Trustworthy Multimodal Large Language Models
NIPS 2024
CRAG - Comprehensive RAG Benchmark
NIPS 2024