Craig Thomson
8 papers · 2022–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (14) 🌈 Renaissance Researcher (5) 🌍 Conference Polyglot (4) 🗺️ Taxonomy Completionist (16)
🧭
Keyword Pioneer
👥
Mega-Team
(77)
Conferences
ACL (4)
EMNLP (2)
COLING (1)
EACL (1)
Top co-authors
Keywords
human evaluation
(3)
natural language processing
(3)
nlp evaluation
(3)
reproducibility assessment
(2)
evaluation methodology
(2)
nlp research
(2)
experimental methodology
(2)
evaluation benchmark
(1)
evaluation metric
(1)
evaluation protocol
(1)
shared task
(1)
text generation evaluation
(1)
peer review
(1)
reproducibility study
(1)
quality criterion
(1)
benchmark design
(1)
survey research
(1)
evaluation taxonomy
(1)
regulation compliance
(1)
machine learning evaluation
(1)
Papers
Evolving Stances on Reproducibility: A Longitudinal Study of NLP and ML Researchers’ Views and Experience of Reproducibility
EMNLP 2025
Standard Quality Criteria Derived from Current NLP Evaluations for Guiding Evaluation Design and Grounding Comparability and AI Compliance Assessments
ACL 2025
HEDS 3.0: The Human Evaluation Data Sheet Version 3.0
ACL 2025
The 2025 ReproNLP Shared Task on Reproducibility of Evaluations in NLP: Overview and Results
ACL 2025
The 2024 ReproNLP Shared Task on Reproducibility of Evaluations in NLP: Overview and Results
COLING 2024
Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP
EACL 2023
Non-Repeatable Experiments and Non-Reproducible Results: The Reproducibility Crisis in Human Evaluation in NLP
ACL 2023
GEMv2: Multilingual NLG Benchmarking in a Single Line of Code
EMNLP 2022