Juraj Juraska
16 papers · 2018–2026 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
🌈 Renaissance Researcher (6) 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (7) 🌍 Conference Polyglot (5) 🗺️ Taxonomy Completionist (32)
🗺️
Taxonomy Completionist
(32)
🧭
Keyword Pioneer
🐣
Hot Topic Early Bird
👥
Mega-Team
(77)
🤝
Dynamic Duo
(10)
🏆
Keyword Champion
(2)
💎
Century Club
(14)
🔥
Unstoppable
(5)
🗃️
Keyword Collector
(65)
❓
The Questioner
⚡
Prolific Year
(5)
Conferences
EMNLP (9)
ACL (2)
EACL (2)
NAACL (2)
ICML (1)
Top co-authors
Keywords
machine translation
(8)
evaluation metric
(4)
quality estimation
(4)
large language model
(3)
synthetic training datum
(2)
human evaluation
(2)
translation quality
(2)
text generation
(2)
translation quality metric
(2)
automatic metric
(2)
dialogue system
(2)
translation evaluation
(2)
natural language generation
(2)
multidimensional quality metrics
(2)
reward model
(1)
quality score prediction
(1)
simulated annealing
(1)
conversational ai
(1)
ensemble learning
(1)
sequence tagging
(1)
Papers
Generating Difficult-to-Translate Texts
EACL 2026
MQM Re-Annotation: A Technique for Collaborative Evaluation of Machine Translation
ACL 2026
Feeding Two Birds or Favoring One? Adequacy–Fluency Tradeoffs in Evaluation and Meta-Evaluation of Machine Translation
EMNLP 2025
WMT24++: Expanding the Language Coverage of WMT24 to 55 Languages & Dialects
ACL 2025
Google Translate’s Research Submission to WMT2025
EMNLP 2025
MetricX-25 and GemSpanEval: Google Translate Submissions to the WMT25 Evaluation Shared Task
EMNLP 2025
From Jack of All Trades to Master of One: Specializing LLM-based Autoraters to a Test Set
ICML 2025
MetricX-24: The Google Submission to the WMT 2024 Metrics Shared Task
EMNLP 2024
LLMRefine: Pinpointing and Refining Large Language Models via Fine-Grained Actionable Feedback
NAACL 2024
Barriers to Effective Evaluation of Simultaneous Interpretation
EACL 2024
Training and Meta-Evaluating Machine Translation Evaluation Metrics at the Paragraph Level
EMNLP 2023
There’s No Data like Better Data: Using QE Metrics for MT Data Filtering
EMNLP 2023
MetricX-23: The Google Submission to the WMT 2023 Metrics Shared Task
EMNLP 2023
GEMv2: Multilingual NLG Benchmarking in a Single Line of Code
EMNLP 2022
Athena 2.0: Contextualized Dialogue Management for an Alexa Prize SocialBot
EMNLP 2021
A Deep Ensemble Model with Slot Alignment for Sequence-to-Sequence Natural Language Generation
NAACL 2018