Co-occurring keywords
Papers
Cherry-Picking in Time Series Forecasting: How to Select Datasets to Make Your Model Shine
AAAI 2025
SubLIME: Subset Selection via Rank Correlation Prediction for Data-Efficient LLM Evaluation
ACL 2025
SATBench: Benchmarking LLMs’ Logical Reasoning via Automated Puzzle Generation from SAT Formulas
EMNLP 2025
When2Call: When (not) to Call Tools
NAACL 2025
WinoWhat: A Parallel Corpus of Paraphrased WinoGrande Sentences with Common Sense Categorization
ACL 2025