conftrace_

Jörg Tiedemann

75 papers · 2003–2026 · 8 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+15 more ↓

🌍 Conference Polyglot (8) 🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (22)

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🏃 Academic Marathon (22) 🏠 Conference Loyalist (23) 🐺 Lone Wolf (6) 🤝 Dynamic Duo (19) 👥 Mega-Team (35) 🔬 Deep Specialist (29) ❓ The Questioner (5) 🚀 Conference Pioneer ⚡ Prolific Year (8) 🗃️ Keyword Collector (221) 📈 Trend Setter 🔥 Unstoppable (9) 💎 Century Club (74)

Conferences

EMNLP (23) ACL (16) COLING (15) NAACL (7) EACL (6) SEMEVAL (4) CONLL (2) IJCNLP (2)

Top co-authors

Raúl Vázquez (19) Alessandro Raganato (14) Yves Scherrer (12) Ona de Gibert (11) Mathias Creutz (9) Shaoxiong Ji (8) Timothee Mickus (7) Mikko Aulamo (7) Sami Virpioja (6) Teemu Vahtola (6)

Research topics

Linguistics (2)

Keywords

machine translation (21) neural machine translation (16) multilingual nlp (7) multilingual model (7) parallel corpus (6) transfer learning (6) low-resource language (6) multilingual translation (6) text classification (5) attention mechanism (5) hallucination detection (4) language model (4) document-level translation (3) sentiment analysis (3) benchmark evaluation (3) translation quality (3) knowledge distillation (3) paraphrase generation (3) pretrained language model (3) multilingual neural machine translation (3)

Papers

Test-Time Scaling of Reasoning Models for Machine Translation EACL 2026 SemEval-2025 Task 3: Mu-SHROOM, the Multilingual Shared-task on Hallucinations and Related Observable Overgeneration Mistakes SEMEVAL 2025 A Cross-Lingual Perspective on Neural Machine Translation Difficulty EMNLP 2025 Implementing and Evaluating Multi-source Retrieval-Augmented Translation EMNLP 2025 DocHPLT: A Massively Multilingual Document-Level Translation Dataset EMNLP 2025 GlotEval: A Test Suite for Massively Multilingual Evaluation of Large Language Models EMNLP 2025 Scaling Low-Resource MT via Synthetic Data Generation with LLMs EMNLP 2025 An Expanded Massive Multilingual Dataset for High-Performance Language Technologies (HPLT) ACL 2025 SemEval-2025 Task 3: Mu-SHROOM, the Multilingual Shared-task on Hallucinations and Related Observable Overgeneration Mistakes ACL 2025 A New Massive Multilingual Dataset for High-Performance Language Technologies COLING 2024 Can Machine Translation Bridge Multilingual Pretraining and Cross-lingual Transfer Learning? COLING 2024 Towards Automatic Finnish Text Simplification COLING 2024 MAMMOTH: Massively Multilingual Modular Open Translation @ Helsinki EACL 2024 A Comparison of Language Modeling and Translation as Multilingual Pretraining Objectives EMNLP 2024 Hybrid Distillation from RBMT and NMT: Helsinki-NLP’s Submission to the Shared Task on Translation into Low-Resource Languages of Spain EMNLP 2024 SemEval-2024 Task 6: SHROOM, a Shared-task on Hallucinations and Related Observable Overgeneration Mistakes NAACL 2024 SemEval-2024 Task 6: SHROOM, a Shared-task on Hallucinations and Related Observable Overgeneration Mistakes SEMEVAL 2024 Four Approaches to Low-Resource Multilingual NMT: The Helsinki Submission to the AmericasNLP 2023 Shared Task ACL 2023 The OPUS-MT Dashboard – A Toolkit for a Systematic Evaluation of Open Machine Translation Models ACL 2023 Helsinki-NLP at SemEval-2022 Task 2: A Feature-Based Approach to Multilingual Idiomaticity Detection SEMEVAL 2022 It Is Not Easy To Detect Paraphrases: Analysing Semantic Similarity With Antonyms and Negation Using the New SemAntoNeg Benchmark EMNLP 2022 Helsinki-NLP at SemEval-2022 Task 2: A Feature-Based Approach to Multilingual Idiomaticity Detection NAACL 2022 When to Laugh and How Hard? A Multimodal Approach to Detecting Humor and Its Intensity COLING 2022 How Does Data Corruption Affect Natural Language Understanding Models? A Study on GLUE datasets NAACL 2022 A Closer Look at Parameter Contributions When Training Neural Language and Translation Models COLING 2022 On the differences between BERT and MT encoder spaces and how to address them in translation tasks IJCNLP 2021 An Empirical Investigation of Word Alignment Supervision for Zero-Shot Multilingual Neural Machine Translation EMNLP 2021 On the differences between BERT and MT encoder spaces and how to address them in translation tasks ACL 2021 Creating an Aligned Russian Text Simplification Dataset from Language Learner Data EACL 2021 The Helsinki submission to the AmericasNLP shared task NAACL 2021 XED: A Multilingual Dataset for Sentiment Analysis and Emotion Detection COLING 2020 Fixed Encoder Self-Attention Patterns in Transformer-Based Machine Translation EMNLP 2020 The Tatoeba Translation Challenge – Realistic Data Sets for Low Resource and Multilingual MT EMNLP 2020 LT@Helsinki at SemEval-2020 Task 12: Multilingual or Language-specific BERT? SEMEVAL 2020 The University of Helsinki Submission to the IWSLT2020 Offline SpeechTranslation Task ACL 2020 The MUCOW word sense disambiguation test suite at WMT 2020 EMNLP 2020 Controlling the Imprint of Passivization and Negation in Contextualized Representations EMNLP 2020 LSDC - A comprehensive dataset for Low Saxon Dialect Classification COLING 2020 OpusFilter: A Configurable Parallel Corpus Filtering Toolbox ACL 2020 LT@Helsinki at SemEval-2020 Task 12: Multilingual or Language-specific BERT? COLING 2020 Measuring Semantic Abstraction of Multilingual NMT with Paraphrase Recognition and Generation Tasks NAACL 2019 Revisiting NMT for Normalization of Early English Letters NAACL 2019 The University of Helsinki Submission to the WMT19 Parallel Corpus Filtering Task ACL 2019 Analysing concatenation approaches to document-level NMT in two different domains EMNLP 2019 Proceedings of the Sixth Workshop on NLP for Similar Languages, Varieties and Dialects NAACL 2019 The MuCoW Test Suite at WMT 2019: Automatically Harvested Multilingual Contrastive Word Sense Disambiguation Test Sets for Machine Translation ACL 2019 The University of Helsinki Submissions to the WMT19 News Translation Task ACL 2019 Multilingual NMT with a Language-Independent Attention Bridge ACL 2019 An Evaluation of Language-Agnostic Inner-Attention-Based Representations in Machine Translation ACL 2019 An Analysis of Encoder Representations in Transformer-Based Machine Translation EMNLP 2018 The MeMAD Submission to the IWSLT 2018 Speech Translation Task EMNLP 2018 Creating a Dataset for Multilingual Fine-grained Emotion-detection Using Gamification-based Annotation EMNLP 2018 The University of Helsinki submissions to the WMT18 news task EMNLP 2018 The MeMAD Submission to the WMT18 Multimodal Translation Task EMNLP 2018 Language Identification and Morphosyntactic Tagging: The Second VarDial Evaluation Campaign COLING 2018 Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2018) COLING 2018 Normalizing Early English Letters to Present-day English Spelling COLING 2018 Continuous multilinguality with language vectors EACL 2017 Character-based Joint Segmentation and POS Tagging for Chinese using Bidirectional RNN-CRF IJCNLP 2017 Treebank Translation for Cross-Lingual Parser Induction CONLL 2014 Rediscovering Annotation Projection for Cross-Lingual Parser Induction COLING 2014 Docent: A Document-Level Decoder for Phrase-Based Statistical Machine Translation ACL 2013 Latent Anaphora Resolution for Cross-Lingual Pronoun Prediction EMNLP 2013 Bitexts as Semantic Mirrors EMNLP 2013 Efficient Discrimination Between Closely Related Languages COLING 2012 Combining Word-Level and Character-Level Models for Machine Translation Between Closely-Related Languages ACL 2012 Document-Wide Decoding for Phrase-Based Statistical Machine Translation EMNLP 2012 Character-Based Pivot Translation for Under-Resourced Languages and Domains EACL 2012 Document-Wide Decoding for Phrase-Based Statistical Machine Translation CONLL 2012 LetsMT!: Cloud-Based Platform for Do-It-Yourself Machine Translation ACL 2012 Finding Synonyms Using Automatic Word Alignment and Measures of Distributional Similarity ACL 2006 Finding Synonyms Using Automatic Word Alignment and Measures of Distributional Similarity COLING 2006 Integrating Linguistic Knowledge in Passage Retrieval for Question Answering EMNLP 2005 Word to word alignment strategies COLING 2004 Combining Clues for Word Alignment EACL 2003