conftrace_

José Hernández-Orallo

19 papers · 2012–2026 · 6 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓
+10 more ↓ 🏃 Academic Marathon (13) 🌍 Conference Polyglot (6) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (6)
🐝 Cross-Pollinator (6) 🌈 Renaissance Researcher (8) 🗺️ Taxonomy Completionist (36) 🏆 Keyword Champion (5) 👥 Mega-Team (27) 🤝 Dynamic Duo (10) The Questioner (2) 💎 Century Club (18) Prolific Year (5) 🗃️ Keyword Collector (87)

Conferences

IJCAI (7) AAAI (4) ACL (3) EACL (2) NIPS (2) JMLR (1)

Papers

TRACE: A Corpus of Team Creative Discussions ACL 2026 Paradigms of AI Evaluation: Mapping Goals, Methodologies and Culture IJCAI 2025 PredictaBoard: Benchmarking LLM Score Predictability ACL 2025 Contamination Budget: Trade-offs Between Breadth, Depth and Difficulty IJCAI 2025 Item Response Theory for Natural Language Processing EACL 2024 A Proposal for Scaling the Scaling Laws EACL 2024 Melting Pot Contest: Charting the Future of Generalized Cooperative Intelligence NIPS 2024 Your Prompt Is My Command: On Assessing the Human-Centred Generality of Multimodal Models (Abstract Reprint) AAAI 2024 Confounders in Instance Variation for the Analysis of Data Contamination ACL 2024 How General-Purpose Is a Language Model? Usefulness and Safety with Human Prompters in the Wild AAAI 2022 Non-Cheating Teaching Revisited: A New Probabilistic Machine Teaching Model IJCAI 2022 Measuring the Occupational Impact of AI: Tasks, Cognitive Abilities and AI Benchmarks (Extended Abstract)* IJCAI 2022 Training on the Test Set: Mapping the System-Problem Space in AI AAAI 2022 When AI Difficulty Is Easy: The Explanatory Power of Predicting IRT Difficulty AAAI 2022 Not a Number: Identifying Instance Features for Capability-Oriented Evaluation IJCAI 2022 Think Big, Teach Small: Do Language Models Distil Occam’s Razor? NIPS 2021 The Facets of Artificial Intelligence: A Framework to Track the Evolution of AI IJCAI 2018 Computer Models Solving Intelligence Test Problems: Progress and Implications (Extended Abstract) IJCAI 2017 A Unified View of Performance Metrics: Translating Threshold Choice into Expected Classification Loss JMLR 2012