Jörg Tiedemann
75 papers · 2003–2026 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
🌍 Conference Polyglot (8) 🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (22)
🧭
Keyword Pioneer
🐣
Hot Topic Early Bird
🏃
Academic Marathon
(22)
🏠
Conference Loyalist
(23)
🐺
Lone Wolf
(6)
🤝
Dynamic Duo
(19)
👥
Mega-Team
(35)
🔬
Deep Specialist
(29)
❓
The Questioner
(5)
🚀
Conference Pioneer
⚡
Prolific Year
(8)
🗃️
Keyword Collector
(221)
📈
Trend Setter
🔥
Unstoppable
(9)
💎
Century Club
(74)
Conferences
EMNLP (23)
ACL (16)
COLING (15)
NAACL (7)
EACL (6)
SEMEVAL (4)
CONLL (2)
IJCNLP (2)
Top co-authors
Research topics
Keywords
machine translation
(21)
neural machine translation
(16)
multilingual nlp
(7)
multilingual model
(7)
parallel corpus
(6)
transfer learning
(6)
low-resource language
(6)
multilingual translation
(6)
text classification
(5)
attention mechanism
(5)
hallucination detection
(4)
language model
(4)
document-level translation
(3)
sentiment analysis
(3)
benchmark evaluation
(3)
translation quality
(3)
knowledge distillation
(3)
paraphrase generation
(3)
pretrained language model
(3)
multilingual neural machine translation
(3)
Papers
Test-Time Scaling of Reasoning Models for Machine Translation
EACL 2026
SemEval-2025 Task 3: Mu-SHROOM, the Multilingual Shared-task on Hallucinations and Related Observable Overgeneration Mistakes
SEMEVAL 2025
A Cross-Lingual Perspective on Neural Machine Translation Difficulty
EMNLP 2025
Implementing and Evaluating Multi-source Retrieval-Augmented Translation
EMNLP 2025
DocHPLT: A Massively Multilingual Document-Level Translation Dataset
EMNLP 2025
GlotEval: A Test Suite for Massively Multilingual Evaluation of Large Language Models
EMNLP 2025
Scaling Low-Resource MT via Synthetic Data Generation with LLMs
EMNLP 2025
An Expanded Massive Multilingual Dataset for High-Performance Language Technologies (HPLT)
ACL 2025
SemEval-2025 Task 3: Mu-SHROOM, the Multilingual Shared-task on Hallucinations and Related Observable Overgeneration Mistakes
ACL 2025
A New Massive Multilingual Dataset for High-Performance Language Technologies
COLING 2024
Can Machine Translation Bridge Multilingual Pretraining and Cross-lingual Transfer Learning?
COLING 2024
Towards Automatic Finnish Text Simplification
COLING 2024
MAMMOTH: Massively Multilingual Modular Open Translation @ Helsinki
EACL 2024
A Comparison of Language Modeling and Translation as Multilingual Pretraining Objectives
EMNLP 2024
Hybrid Distillation from RBMT and NMT: Helsinki-NLP’s Submission to the Shared Task on Translation into Low-Resource Languages of Spain
EMNLP 2024
SemEval-2024 Task 6: SHROOM, a Shared-task on Hallucinations and Related Observable Overgeneration Mistakes
NAACL 2024
SemEval-2024 Task 6: SHROOM, a Shared-task on Hallucinations and Related Observable Overgeneration Mistakes
SEMEVAL 2024
Four Approaches to Low-Resource Multilingual NMT: The Helsinki Submission to the AmericasNLP 2023 Shared Task
ACL 2023
The OPUS-MT Dashboard – A Toolkit for a Systematic Evaluation of Open Machine Translation Models
ACL 2023
Helsinki-NLP at SemEval-2022 Task 2: A Feature-Based Approach to Multilingual Idiomaticity Detection
SEMEVAL 2022
It Is Not Easy To Detect Paraphrases: Analysing Semantic Similarity With Antonyms and Negation Using the New SemAntoNeg Benchmark
EMNLP 2022
Helsinki-NLP at SemEval-2022 Task 2: A Feature-Based Approach to Multilingual Idiomaticity Detection
NAACL 2022
When to Laugh and How Hard? A Multimodal Approach to Detecting Humor and Its Intensity
COLING 2022
How Does Data Corruption Affect Natural Language Understanding Models? A Study on GLUE datasets
NAACL 2022
A Closer Look at Parameter Contributions When Training Neural Language and Translation Models
COLING 2022
On the differences between BERT and MT encoder spaces and how to address them in translation tasks
IJCNLP 2021
An Empirical Investigation of Word Alignment Supervision for Zero-Shot Multilingual Neural Machine Translation
EMNLP 2021
On the differences between BERT and MT encoder spaces and how to address them in translation tasks
ACL 2021
Creating an Aligned Russian Text Simplification Dataset from Language Learner Data
EACL 2021
The Helsinki submission to the AmericasNLP shared task
NAACL 2021
XED: A Multilingual Dataset for Sentiment Analysis and Emotion Detection
COLING 2020
Fixed Encoder Self-Attention Patterns in Transformer-Based Machine Translation
EMNLP 2020
The Tatoeba Translation Challenge – Realistic Data Sets for Low Resource and Multilingual MT
EMNLP 2020
LT@Helsinki at SemEval-2020 Task 12: Multilingual or Language-specific BERT?
SEMEVAL 2020
The University of Helsinki Submission to the IWSLT2020 Offline SpeechTranslation Task
ACL 2020
The MUCOW word sense disambiguation test suite at WMT 2020
EMNLP 2020
Controlling the Imprint of Passivization and Negation in Contextualized Representations
EMNLP 2020
LSDC - A comprehensive dataset for Low Saxon Dialect Classification
COLING 2020
OpusFilter: A Configurable Parallel Corpus Filtering Toolbox
ACL 2020
LT@Helsinki at SemEval-2020 Task 12: Multilingual or Language-specific BERT?
COLING 2020
Measuring Semantic Abstraction of Multilingual NMT with Paraphrase Recognition and Generation Tasks
NAACL 2019
Revisiting NMT for Normalization of Early English Letters
NAACL 2019
The University of Helsinki Submission to the WMT19 Parallel Corpus Filtering Task
ACL 2019
Analysing concatenation approaches to document-level NMT in two different domains
EMNLP 2019
Proceedings of the Sixth Workshop on NLP for Similar Languages, Varieties and Dialects
NAACL 2019
The MuCoW Test Suite at WMT 2019: Automatically Harvested Multilingual Contrastive Word Sense Disambiguation Test Sets for Machine Translation
ACL 2019
The University of Helsinki Submissions to the WMT19 News Translation Task
ACL 2019
Multilingual NMT with a Language-Independent Attention Bridge
ACL 2019
An Evaluation of Language-Agnostic Inner-Attention-Based Representations in Machine Translation
ACL 2019
An Analysis of Encoder Representations in Transformer-Based Machine Translation
EMNLP 2018
The MeMAD Submission to the IWSLT 2018 Speech Translation Task
EMNLP 2018
Creating a Dataset for Multilingual Fine-grained Emotion-detection Using Gamification-based Annotation
EMNLP 2018
The University of Helsinki submissions to the WMT18 news task
EMNLP 2018
The MeMAD Submission to the WMT18 Multimodal Translation Task
EMNLP 2018
Language Identification and Morphosyntactic Tagging: The Second VarDial Evaluation Campaign
COLING 2018
Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2018)
COLING 2018
Normalizing Early English Letters to Present-day English Spelling
COLING 2018
Continuous multilinguality with language vectors
EACL 2017
Character-based Joint Segmentation and POS Tagging for Chinese using Bidirectional RNN-CRF
IJCNLP 2017
Treebank Translation for Cross-Lingual Parser Induction
CONLL 2014
Rediscovering Annotation Projection for Cross-Lingual Parser Induction
COLING 2014
Docent: A Document-Level Decoder for Phrase-Based Statistical Machine Translation
ACL 2013
Latent Anaphora Resolution for Cross-Lingual Pronoun Prediction
EMNLP 2013
Bitexts as Semantic Mirrors
EMNLP 2013
Efficient Discrimination Between Closely Related Languages
COLING 2012
Combining Word-Level and Character-Level Models for Machine Translation Between Closely-Related Languages
ACL 2012
Document-Wide Decoding for Phrase-Based Statistical Machine Translation
EMNLP 2012
Character-Based Pivot Translation for Under-Resourced Languages and Domains
EACL 2012
Document-Wide Decoding for Phrase-Based Statistical Machine Translation
CONLL 2012
LetsMT!: Cloud-Based Platform for Do-It-Yourself Machine Translation
ACL 2012
Finding Synonyms Using Automatic Word Alignment and Measures of Distributional Similarity
ACL 2006
Finding Synonyms Using Automatic Word Alignment and Measures of Distributional Similarity
COLING 2006
Integrating Linguistic Knowledge in Passage Retrieval for Question Answering
EMNLP 2005
Word to word alignment strategies
COLING 2004
Combining Clues for Word Alignment
EACL 2003