Yves Scherrer
46 papers · 2007–2026 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
🏃 Academic Marathon (18) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (5) 🐝 Cross-Pollinator (8)
🌍
Conference Polyglot
(5)
🏃
Academic Marathon
(18)
🌈
Renaissance Researcher
(6)
🐺
Lone Wolf
(3)
🏆
Keyword Champion
(2)
🔬
Deep Specialist
(14)
🤝
Dynamic Duo
(12)
💎
Century Club
(45)
🗃️
Keyword Collector
(177)
⚡
Prolific Year
(6)
🔥
Unstoppable
(8)
📈
Trend Setter
Conferences
EMNLP (16)
ACL (9)
COLING (9)
NAACL (7)
EACL (5)
Top co-authors
Research topics
Keywords
neural machine translation
(13)
multilingual nlp
(8)
dialect identification
(8)
language identification
(7)
low-resource language
(6)
machine translation
(6)
intent detection
(4)
shared task
(4)
knowledge distillation
(3)
multilingual translation
(3)
language variety
(3)
low-resource translation
(3)
data augmentation
(3)
multilingual model
(3)
norwegian dialect
(3)
transfer learning
(2)
representation learning
(2)
subword tokenization
(2)
social media
(2)
parallel corpus
(2)
Papers
OpenLID-v3: Improving the Precision of Closely Related Language Identification – An Experience Report
EACL 2026
Improved Norwegian Bokmål Translations for FLORES
EMNLP 2025
EdinHelsOW WMT 2025 CreoleMT System Description: Improving Lusophone Creole Translation through Data Augmentation, Model Merging and LLM Post-editing
EMNLP 2025
LTG at VarDial 2025 NorSID: More and Better Training Data for Slot and Intent Detection
COLING 2025
Findings of the VarDial Evaluation Campaign 2025: The NorSID Shared Task on Norwegian Slot, Intent and Dialect Identification
COLING 2025
Functional Lexicon in Subword Tokenization
NAACL 2025
Dialects, Topic Models, and Border Effects: The Rusyn Case
ACL 2025
Explaining novel senses using definition generation with open language models
EMNLP 2025
Hybrid Distillation from RBMT and NMT: Helsinki-NLP’s Submission to the Shared Task on Translation into Low-Resource Languages of Spain
EMNLP 2024
Definition generation for lexical semantic change detection
ACL 2024
VarDial Evaluation Campaign 2024: Commonsense Reasoning in Dialects and Multi-Label Similar Language Identification
NAACL 2024
System Description of the NordicsAlps Submission to the AmericasNLP 2024 Machine Translation Shared Task
NAACL 2024
NoMusic - The Norwegian Multi-Dialectal Slot and Intent Detection Corpus
NAACL 2024
Character alignment methods for dialect-to-standard normalization
ACL 2023
The Helsinki-NLP Submissions at NADI 2023 Shared Task: Walking the Baseline
EMNLP 2023
Dialect-to-Standard Normalization: A Large-Scale Multilingual Evaluation
EMNLP 2023
Dialect Representation Learning with Neural Dialect-to-Standard Normalization
EACL 2023
Findings of the VarDial Evaluation Campaign 2023
EACL 2023
Four Approaches to Low-Resource Multilingual NMT: The Helsinki Submission to the AmericasNLP 2023 Shared Task
ACL 2023
Changing usage of Low Saxon auxiliary and modal verbs
EMNLP 2023
Low Saxon dialect distances at the orthographic and syntactic level
ACL 2022
Findings of the VarDial Evaluation Campaign 2022
COLING 2022
OcWikiDisc: a Corpus of Wikipedia Talk Pages in Occitan
COLING 2022
Sesame Street to Mount Sinai: BERT-constrained character-level Moses models for multilingual lexical normalization
EMNLP 2021
The Helsinki submission to the AmericasNLP shared task
NAACL 2021
Findings of the VarDial Evaluation Campaign 2021
EACL 2021
Social Media Variety Geolocation with geoBERT
EACL 2021
The MUCOW word sense disambiguation test suite at WMT 2020
EMNLP 2020
A Report on the VarDial Evaluation Campaign 2020
COLING 2020
LSDC - A comprehensive dataset for Low Saxon Dialect Classification
COLING 2020
HeLju@VarDial 2020: Social Media Variety Geolocation with BERT Models
COLING 2020
Fixed Encoder Self-Attention Patterns in Transformer-Based Machine Translation
EMNLP 2020
The University of Helsinki and Aalto University submissions to the WMT 2020 news and low-resource translation tasks
EMNLP 2020
The University of Helsinki Submissions to the WMT19 News Translation Task
ACL 2019
Analysing concatenation approaches to document-level NMT in two different domains
EMNLP 2019
Measuring Semantic Abstraction of Multilingual NMT with Paraphrase Recognition and Generation Tasks
NAACL 2019
A Report on the Third VarDial Evaluation Campaign
NAACL 2019
The University of Helsinki Submissions to the WMT19 Similar Language Translation Task
ACL 2019
The MuCoW Test Suite at WMT 2019: Automatically Harvested Multilingual Contrastive Word Sense Disambiguation Test Sets for Machine Translation
ACL 2019
The WMT’18 Morpheval test suites for English-Czech, English-German, English-Finnish and Turkish-English
EMNLP 2018
Language Identification and Morphosyntactic Tagging: The Second VarDial Evaluation Campaign
COLING 2018
The University of Helsinki submissions to the WMT18 news task
EMNLP 2018
The University of Helsinki submissions to the IWSLT 2018 low-resource translation task
EMNLP 2018
On-line Multilingual Linguistic Services
COLING 2016
Word-Based Dialect Identification with Georeferenced Rules
EMNLP 2010
Adaptive String Distance Measures for Bilingual Dialect Lexicon Induction
ACL 2007