Co-occurring keywords
Papers
How Robust are Model Rankings : A Leaderboard Customization Approach for Equitable Evaluation
AAAI 2021
Comparing Test Sets with Item Response Theory
IJCNLP 2021
Does my multimodal model learn cross-modal interactions? It’s harder to tell than you might think!
EMNLP 2020