Ehud Reiter

23 papers · 2001–2025 · 5 conferences · across top CS/AI conferences

Achievements

+10 more ↓

🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (7) 🏃 Academic Marathon (24) 🌍 Conference Polyglot (5) 🗺️ Taxonomy Completionist (38)

🏃 Academic Marathon (24) 🗺️ Taxonomy Completionist (38) 🌉 Interdisciplinary Bridge 👥 Mega-Team (42) ❓ The Questioner 🔥 Unstoppable (5) 💎 Century Club (23) 🗃️ Keyword Collector (87) 🚀 Conference Pioneer ⚡ Prolific Year (5)

Conferences

EACL (8) ACL (7) EMNLP (4) NAACL (3) COLING (1)

Top co-authors

Anya Belz (7) Francesco Moramarco (5) Aleksandar Savkov (5) Alex Papadopoulos Korfiatis (4) Mark Perera (3) Craig Thomson (3) Simone Balloccu (3) Diyi Yang (2) Kees van Deemter (2) Alberto Bugarín-Diz (2)

Keywords

human evaluation (5) natural language generation (4) text generation (3) large language model (3) experimental methodology (3) natural language processing (3) nlp research (3) text summarization (2) medical note generation (2) table-to-text generation (2) evaluation methodology (2) bayesian network (2) clinical documentation (2) explainable ai (1) empirical study (1) fact verification (1) dataset creation (1) metric learning (1) dialogue generation (1) affective computing (1)

Papers

CausalGraphBench: a Benchmark for Evaluating Language Models capabilities of Causal Graph discovery ACL 2025 Evolving Stances on Reproducibility: A Longitudinal Study of NLP and ML Researchers’ Views and Experience of Reproducibility EMNLP 2025 Scalability of Bayesian Network Structure Elicitation with Large Language Models: a Novel Methodology and Comparative Analysis COLING 2025 SPHERE: An Evaluation Card for Human-AI Systems ACL 2025 Linguistically Communicating Uncertainty in Patient-Facing Risk Prediction Models EACL 2024 Improving Factual Accuracy of Neural Table-to-Text Output by Addressing Input Problems in ToTTo NAACL 2024 Ask the experts: sourcing a high-quality nutrition counseling dataset through Human-AI collaboration EMNLP 2024 Are Experts Needed? On Human Evaluation of Counselling Reflection Generation ACL 2023 Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP EACL 2023 Non-Repeatable Experiments and Non-Reproducible Results: The Reproducibility Crisis in Human Evaluation in NLP ACL 2023 Consultation Checklists: Standardising the Human Evaluation of Medical Note Generation EMNLP 2022 Error Analysis of ToTTo Table-to-Text Neural NLG Models EMNLP 2022 Human Evaluation and Correlation with Automatic Metrics in Consultation Note Generation ACL 2022 Beyond calories: evaluating how tailored communication reduces emotional load in diet-coaching ACL 2022 User-Driven Research of Medical Note Generation Software NAACL 2022 A Systematic Review of Reproducibility Research in Natural Language Processing EACL 2021 Towards Objectively Evaluating the Quality of Generated Medical Summaries EACL 2021 A Preliminary Study on Evaluating Consultation Notes With Post-Editing EACL 2021 Generating Expressions that Refer to Visible Objects NAACL 2013 Generating Spatio-Temporal Descriptions in Pollen Forecasts EACL 2006 Comparing Automatic and Human Evaluation of NLG Systems EACL 2006 Summarizing Neonatal Time Series Data EACL 2003 Using a Randomised Controlled Clinical Trial to Evaluate an NLG System ACL 2001