Sami Virpioja
20 papers · 2009–2025 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+7 more ↓ Show less ↑
🏃 Academic Marathon (16) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (7) 🐝 Cross-Pollinator (8)
🏃
Academic Marathon
(16)
🗺️
Taxonomy Completionist
(28)
🐝
Cross-Pollinator
(8)
🤝
Dynamic Duo
(11)
🔥
Unstoppable
(7)
🗃️
Keyword Collector
(70)
💎
Century Club
(20)
Conferences
ACL (4)
EMNLP (4)
NAACL (4)
EACL (3)
INTERSPEECH (3)
COLING (1)
CONLL (1)
Top co-authors
Research topics
Keywords
neural machine translation
(6)
low-resource language
(2)
morphological segmentation
(2)
parallel corpus
(2)
transfer learning
(2)
multilingual translation
(2)
data filtering
(2)
subword segmentation
(2)
machine translation
(1)
information retrieval
(1)
knowledge distillation
(1)
speech recognition
(1)
bilingual alignment
(1)
document-level translation
(1)
logistic regression
(1)
syntactic information
(1)
embedding similarity
(1)
data augmentation
(1)
semantic similarity
(1)
language identification
(1)
Papers
Implementing and Evaluating Multi-source Retrieval-Augmented Translation
EMNLP 2025
Four Approaches to Low-Resource Multilingual NMT: The Helsinki Submission to the AmericasNLP 2023 Shared Task
ACL 2023
Morfessor-enriched features and multilingual training for canonical morphological segmentation
NAACL 2022
The Helsinki submission to the AmericasNLP shared task
NAACL 2021
Controlling the Imprint of Passivization and Negation in Contextualized Representations
EMNLP 2020
OpusFilter: A Configurable Parallel Corpus Filtering Toolbox
ACL 2020
The University of Helsinki and Aalto University submissions to the WMT 2020 news and low-resource translation tasks
EMNLP 2020
FinChat: Corpus and Evaluation Setup for Finnish Chat Conversations on Everyday Topics
INTERSPEECH 2020
Subword RNNLM Approximations for Out-Of-Vocabulary Keyword Search
INTERSPEECH 2019
The University of Helsinki Submissions to the WMT19 News Translation Task
ACL 2019
The University of Helsinki Submissions to the WMT19 Similar Language Translation Task
ACL 2019
Cognate-aware morphological segmentation for multilingual neural translation
EMNLP 2018
Improved Subword Modeling for WFST-Based Speech Recognition
INTERSPEECH 2017
Morfessor 2.0: Toolkit for statistical morphological segmentation
EACL 2014
Morfessor FlatCat: An HMM-Based Method for Unsupervised and Semi-Supervised Learning of Morphology
COLING 2014
Painless Semi-Supervised Morphological Segmentation using Conditional Random Fields
EACL 2014
Supervised Morphological Segmentation in a Low-Resource Learning Setting using Conditional Random Fields
CONLL 2013
Minimum Bayes Risk Combination of Translation Hypotheses from Alternative Morphological Decompositions
NAACL 2009
Morpho Challenge - Evaluation of algorithms for unsupervised learning of morphology in various tasks and languages
NAACL 2009
Web Augmentation of Language Models for Continuous Speech Recognition of SMS Text Messages
EACL 2009