Mikko Aulamo
8 papers · 2018–2025 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
🧭 Keyword Pioneer 🌍 Conference Polyglot (3) 🏃 Academic Marathon (7) 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (14)
🗺️
Taxonomy Completionist
(17)
👥
Mega-Team
(35)
Conferences
ACL (4)
EMNLP (3)
COLING (1)
Top co-authors
Keywords
machine translation
(5)
parallel corpus
(4)
multilingual corpus
(2)
knowledge distillation
(2)
low-resource language
(2)
corpus quality
(2)
automatic speech recognition
(1)
language modeling
(1)
domain adaptation
(1)
neural machine translation
(1)
multilingual translation
(1)
bilingual alignment
(1)
knowledge transfer
(1)
data augmentation
(1)
multitask learning
(1)
language identification
(1)
computational efficiency
(1)
paraphrase detection
(1)
multilingual nlp
(1)
logistic regression
(1)
Papers
An Expanded Massive Multilingual Dataset for High-Performance Language Technologies (HPLT)
ACL 2025
Scaling Low-Resource MT via Synthetic Data Generation with LLMs
EMNLP 2025
A New Massive Multilingual Dataset for High-Performance Language Technologies
COLING 2024
Hybrid Distillation from RBMT and NMT: Helsinki-NLP’s Submission to the Shared Task on Translation into Low-Resource Languages of Spain
EMNLP 2024
Four Approaches to Low-Resource Multilingual NMT: The Helsinki Submission to the AmericasNLP 2023 Shared Task
ACL 2023
OpusFilter: A Configurable Parallel Corpus Filtering Toolbox
ACL 2020
The University of Helsinki Submission to the IWSLT2020 Offline SpeechTranslation Task
ACL 2020
Paraphrase Detection on Noisy Subtitles in Six Languages
EMNLP 2018