conftrace_

Ehud Reiter

23 papers · 2001–2025 · 5 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓
+10 more ↓ πŸŒ‰ Interdisciplinary Bridge 🌈 Renaissance Researcher (7) πŸƒ Academic Marathon (24) 🌍 Conference Polyglot (5) πŸ—ΊοΈ Taxonomy Completionist (38)
πŸƒ Academic Marathon (24) πŸ—ΊοΈ Taxonomy Completionist (38) πŸŒ‰ Interdisciplinary Bridge πŸ‘₯ Mega-Team (42) ❓ The Questioner πŸ”₯ Unstoppable (5) πŸ’Ž Century Club (23) πŸ—ƒοΈ Keyword Collector (87) πŸš€ Conference Pioneer ⚑ Prolific Year (5)

Conferences

EACL (8) ACL (7) EMNLP (4) NAACL (3) COLING (1)

Papers

CausalGraphBench: a Benchmark for Evaluating Language Models capabilities of Causal Graph discovery ACL 2025 Evolving Stances on Reproducibility: A Longitudinal Study of NLP and ML Researchers’ Views and Experience of Reproducibility EMNLP 2025 Scalability of Bayesian Network Structure Elicitation with Large Language Models: a Novel Methodology and Comparative Analysis COLING 2025 SPHERE: An Evaluation Card for Human-AI Systems ACL 2025 Linguistically Communicating Uncertainty in Patient-Facing Risk Prediction Models EACL 2024 Improving Factual Accuracy of Neural Table-to-Text Output by Addressing Input Problems in ToTTo NAACL 2024 Ask the experts: sourcing a high-quality nutrition counseling dataset through Human-AI collaboration EMNLP 2024 Are Experts Needed? On Human Evaluation of Counselling Reflection Generation ACL 2023 Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP EACL 2023 Non-Repeatable Experiments and Non-Reproducible Results: The Reproducibility Crisis in Human Evaluation in NLP ACL 2023 Consultation Checklists: Standardising the Human Evaluation of Medical Note Generation EMNLP 2022 Error Analysis of ToTTo Table-to-Text Neural NLG Models EMNLP 2022 Human Evaluation and Correlation with Automatic Metrics in Consultation Note Generation ACL 2022 Beyond calories: evaluating how tailored communication reduces emotional load in diet-coaching ACL 2022 User-Driven Research of Medical Note Generation Software NAACL 2022 A Systematic Review of Reproducibility Research in Natural Language Processing EACL 2021 Towards Objectively Evaluating the Quality of Generated Medical Summaries EACL 2021 A Preliminary Study on Evaluating Consultation Notes With Post-Editing EACL 2021 Generating Expressions that Refer to Visible Objects NAACL 2013 Generating Spatio-Temporal Descriptions in Pollen Forecasts EACL 2006 Comparing Automatic and Human Evaluation of NLG Systems EACL 2006 Summarizing Neonatal Time Series Data EACL 2003 Using a Randomised Controlled Clinical Trial to Evaluate an NLG System ACL 2001