Co-occurring keywords
Papers
Transitioning from benchmarks to a real-world case of information-seeking in Scientific Publications
ACL 2023
Is GPT-4 a Good Data Analyst?
EMNLP 2023
SuperTweetEval: A Challenging, Unified and Heterogeneous Benchmark for Social Media NLP Research
EMNLP 2023
Towards Explainable and Accessible AI
EMNLP 2023
The Validity of Evaluation Results: Assessing Concurrence Across Compositionality Benchmarks
CONLL 2023
It’s about Time: Rethinking Evaluation on Rumor Detection Benchmarks using Chronological Splits
EACL 2023