conftrace_

Craig Thomson

8 papers · 2022–2025 · 4 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+2 more ↓

🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (14) 🌈 Renaissance Researcher (5) 🌍 Conference Polyglot (4) 🗺️ Taxonomy Completionist (16)

🧭 Keyword Pioneer 👥 Mega-Team (77)

Conferences

ACL (4) EMNLP (2) COLING (1) EACL (1)

Top co-authors

Anya Belz (7) Simon Mille (3) Ehud Reiter (3) João Sedoc (2) Ondřej Dušek (2) Saad Mahamood (2) Elizabeth Clark (2) Dimitra Gkatzia (2) Yiru Li (1) Qi Zhu (1)

Keywords

human evaluation (3) natural language processing (3) nlp evaluation (3) reproducibility assessment (2) evaluation methodology (2) nlp research (2) experimental methodology (2) evaluation benchmark (1) evaluation metric (1) evaluation protocol (1) shared task (1) text generation evaluation (1) peer review (1) reproducibility study (1) quality criterion (1) benchmark design (1) survey research (1) evaluation taxonomy (1) regulation compliance (1) machine learning evaluation (1)

Papers

Evolving Stances on Reproducibility: A Longitudinal Study of NLP and ML Researchers’ Views and Experience of Reproducibility EMNLP 2025 Standard Quality Criteria Derived from Current NLP Evaluations for Guiding Evaluation Design and Grounding Comparability and AI Compliance Assessments ACL 2025 HEDS 3.0: The Human Evaluation Data Sheet Version 3.0 ACL 2025 The 2025 ReproNLP Shared Task on Reproducibility of Evaluations in NLP: Overview and Results ACL 2025 The 2024 ReproNLP Shared Task on Reproducibility of Evaluations in NLP: Overview and Results COLING 2024 Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP EACL 2023 Non-Repeatable Experiments and Non-Reproducible Results: The Reproducibility Crisis in Human Evaluation in NLP ACL 2023 GEMv2: Multilingual NLG Benchmarking in a Single Line of Code EMNLP 2022