2018
ACL
ACL 2018
Marian: Cost-effective High-Quality Neural Machine Translation in C++
Abstract
AbstractThis paper describes the submissions of the โMarianโ team to the WNMT 2018 shared task. We investigate combinations of teacher-student training, low-precision matrix products, auto-tuning and other methods to optimize the Transformer model on GPU and CPU. By further integrating these methods with the new averaging attention networks, a recently introduced faster Transformer variant, we create a number of high-quality, high-performance models on the GPU and CPU, dominating the Pareto frontier for this shared task.
๐
Interdisciplinary Bridge
- Artificial Intelligence and Machine Learning and Natural Language Processing
๐
Trend Setter
- Model Compression
๐งญ
Keyword Pioneer
- transformer model
๐ฃ
Hot Topic Early Bird
- neural machine translation
๐
Cross-Pollinator
- Artificial Intelligence, Computer Vision, Deep Learning, Healthcare & Medicine, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Speech & Audio
Authors
Topics
Artificial Intelligence > Core AI > Model Compression
Machine Learning > Application Areas > Efficient Computing
Natural Language Processing > Applications > Machine Translation
Deep Learning > Optimization & Theory > Model Compression
Deep Learning > Learning Types > Knowledge Distillation
Deep Learning > Optimization & Theory > Efficient Computing