Improving Long-Context Translation via Self-Supervised Dual Learning

Shanbo Cheng; Shuaijie She; Yu Bao; Jianbing Zhang; Jiajun Chen; Shujian Huang

2026 ACL ACL 2026

Improving Long-Context Translation via Self-Supervised Dual Learning

Abstract

AbstractLarge language models (LLMs) with long context windows offer the potential to translate entire documents in a single pass, yet they frequently suffer from catastrophic information distortion, undermining the strict faithfulness required for translation. This challenge is compounded by the scarcity of document-level parallel data, which makes both supervised fine-tuning and reliable evaluation prohibitively expensive. We propose LongDu, a self-supervised post-training framework that improves long-document translation reliability via round-trip consistency. Given monolingual documents, LongDu samples multiple candidate translations, back-translates each candidate, and optimizes the model to prefer translations that best reconstruct the source. To make this signal robust for long-form generation, we design a reward that filters trivial failure modes (e.g., copying and local language drift) before applying a reconstruction and fluency score, enabling stable reinforcement learning without human annotations. We additionally introduce Long-CIRT, an automatic evaluation protocol that quantifies information distortion by measuring how much a LLM’s performance degrades after a translation cycle. Across multiple base models, LongDu substantially improves information retention and translation quality, with gains that generalize beyond the training length range and to unseen target languages.

Authors

Shanbo Cheng , Shuaijie She , Yu Bao , Jianbing Zhang , Jiajun Chen , Shujian Huang

Topics

Natural Language Processing > Applications > Machine Translation Artificial Intelligence > Core AI > Large Language Models Deep Learning > Learning Types > Self-Supervised Learning

Keywords

self-supervised learning round-trip consistency information retention large language model long-document translation

Download PDF

Related papers

No Reader Left Behind: Multi-Agent Summaries Everyone Can Understand 2026

One-step Nonautoregressive Natural Language Generation with Shortcut Flow Matching Models 2026

Optimizing Retrieval-Augmented Generation for E-Commerce How-To Assistance 2026

Make Mechanistic Interpretability Auditable: A Call to Develop Guidelines via Continuous Collaborative Reviewing 2026

MQM Re-Annotation: A Technique for Collaborative Evaluation of Machine Translation 2026