Juraj Juraska

16 papers · 2018–2026 · 5 conferences · across top CS/AI conferences

Achievements

+11 more ↓

🌈 Renaissance Researcher (6) 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (7) 🌍 Conference Polyglot (5) 🗺️ Taxonomy Completionist (32)

🗺️ Taxonomy Completionist (32) 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 👥 Mega-Team (77) 🤝 Dynamic Duo (10) 🏆 Keyword Champion (2) 💎 Century Club (14) 🔥 Unstoppable (5) 🗃️ Keyword Collector (65) ❓ The Questioner ⚡ Prolific Year (5)

Conferences

EMNLP (9) ACL (2) EACL (2) NAACL (2) ICML (1)

Top co-authors

Markus Freitag (12) Mara Finkelstein (11) Daniel Deutsch (11) Geza Kovacs (4) Parker Riley (4) Tobias Domhan (3) Jan-Thorsten Peter (3) David Vilar (3) Kevin Bowden (2) Marilyn Walker (2)

Keywords

machine translation (8) evaluation metric (4) quality estimation (4) large language model (3) synthetic training datum (2) human evaluation (2) translation quality (2) text generation (2) translation quality metric (2) automatic metric (2) dialogue system (2) translation evaluation (2) natural language generation (2) multidimensional quality metrics (2) reward model (1) quality score prediction (1) simulated annealing (1) conversational ai (1) ensemble learning (1) sequence tagging (1)

Papers

Generating Difficult-to-Translate Texts EACL 2026 MQM Re-Annotation: A Technique for Collaborative Evaluation of Machine Translation ACL 2026 Feeding Two Birds or Favoring One? Adequacy–Fluency Tradeoffs in Evaluation and Meta-Evaluation of Machine Translation EMNLP 2025 WMT24++: Expanding the Language Coverage of WMT24 to 55 Languages & Dialects ACL 2025 Google Translate’s Research Submission to WMT2025 EMNLP 2025 MetricX-25 and GemSpanEval: Google Translate Submissions to the WMT25 Evaluation Shared Task EMNLP 2025 From Jack of All Trades to Master of One: Specializing LLM-based Autoraters to a Test Set ICML 2025 MetricX-24: The Google Submission to the WMT 2024 Metrics Shared Task EMNLP 2024 LLMRefine: Pinpointing and Refining Large Language Models via Fine-Grained Actionable Feedback NAACL 2024 Barriers to Effective Evaluation of Simultaneous Interpretation EACL 2024 Training and Meta-Evaluating Machine Translation Evaluation Metrics at the Paragraph Level EMNLP 2023 There’s No Data like Better Data: Using QE Metrics for MT Data Filtering EMNLP 2023 MetricX-23: The Google Submission to the WMT 2023 Metrics Shared Task EMNLP 2023 GEMv2: Multilingual NLG Benchmarking in a Single Line of Code EMNLP 2022 Athena 2.0: Contextualized Dialogue Management for an Alexa Prize SocialBot EMNLP 2021 A Deep Ensemble Model with Slot Alignment for Sequence-to-Sequence Natural Language Generation NAACL 2018