MQM Re-Annotation: A Technique for Collaborative Evaluation of Machine Translation

Parker Riley; Daniel Deutsch; Mara Finkelstein; Colten DiIanni; Juraj Juraska; Markus Freitag

2026 ACL ACL 2026

MQM Re-Annotation: A Technique for Collaborative Evaluation of Machine Translation

Abstract

AbstractHuman evaluation of machine translation is in an arms race with translation model quality: as our models get better, our evaluation methods need to be improved to ensure that quality gains are not lost in evaluation noise. To improve annotation quality, we experiment with a two-stage version of the current state-of-the-art translation evaluation paradigm (MQM), which we call MQM re-annotation. In this setup, an annotator reviews and edits a set of prior MQM annotations that may have come from themselves, another human annotator, or an automatic system. We demonstrate that rater behavior in re-annotation aligns with our goals, and that re-annotation results in higher-quality annotations, mostly due to finding errors that were missed during the first pass.

Authors

Parker Riley , Daniel Deutsch , Mara Finkelstein , Colten DiIanni , Juraj Juraska , Markus Freitag

Topics

Natural Language Processing > Applications > Machine Translation Natural Language Processing > Applications > Evaluation

Keywords

machine translation human evaluation annotation quality translation evaluation multidimensional quality metrics

Download PDF

Related papers

No Reader Left Behind: Multi-Agent Summaries Everyone Can Understand 2026

One-step Nonautoregressive Natural Language Generation with Shortcut Flow Matching Models 2026

Optimizing Retrieval-Augmented Generation for E-Commerce How-To Assistance 2026

Make Mechanistic Interpretability Auditable: A Call to Develop Guidelines via Continuous Collaborative Reviewing 2026

MT3: A Synergistic Multi-Task RL Framework for Specializing MLLMs in Text Image Machine Translation 2026