Josef van Genabith
123 papers · 2004–2026 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
🌍 Conference Polyglot (10) 🐣 Hot Topic Early Bird 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🏃 Academic Marathon (21)
🧭
Keyword Pioneer
🐣
Hot Topic Early Bird
🏃
Academic Marathon
(21)
🌟
Keyword Trendsetter Combo
(4)
🏠
Conference Loyalist
(34)
🤝
Dynamic Duo
(19)
🔬
Deep Specialist
(37)
🏆
Keyword Champion
(3)
⚡
Prolific Year
(6)
🗃️
Keyword Collector
(291)
❓
The Questioner
(2)
💎
Century Club
(116)
🔥
Unstoppable
(16)
📈
Trend Setter
Conferences
ACL (35)
EMNLP (25)
COLING (18)
NAACL (10)
IJCNLP (9)
EACL (8)
SEMEVAL (7)
AACL (5)
CONLL (5)
IJCAI (1)
Top co-authors
Keywords
neural machine translation
(24)
machine translation
(16)
low-resource language
(9)
attention mechanism
(7)
automatic post-editing
(7)
large language model
(5)
coreference resolution
(5)
unsupervised learning
(5)
sentence embedding
(4)
transformer architecture
(4)
human-computer interaction
(4)
multilingual translation
(4)
representation learning
(4)
self-supervised learning
(4)
sign language translation
(4)
cross-lingual transfer
(4)
recurrent neural network
(3)
transfer learning
(3)
multilingual nlp
(3)
multimodal learning
(3)
Papers
Why Does Reinforcement Learning Generalize? A Feature-Level Mechanistic Study of Post-Training in Large Language Models
ACL 2026
A Comprehensive Evaluation of Chain-of-Thought Faithfulness in Persian Classification Tasks
EACL 2026
When Flores Bloomz Wrong: Cross-Direction Contamination in Machine Translation Evaluation
EACL 2026
Modular Arithmetic: Language Models Solve Math Digit by Digit
AACL 2025
Multilingual Political Views of Large Language Models: Identification and Steering
AACL 2025
On Multilingual Encoder Language Model Compression for Low-Resource Languages
AACL 2025
Reverse Probing: Evaluating Knowledge Transfer via Finetuned Task Embeddings for Coreference Resolution
NAACL 2025
MultiCoPIE: A Multilingual Corpus of Potentially Idiomatic Expressions for Cross-lingual PIE Disambiguation
NAACL 2025
AutoPsyC: Automatic Recognition of Psychodynamic Conflicts from Semi-structured Interviews with Large Language Models
NAACL 2025
Continual Learning in Multilingual Sign Language Translation
NAACL 2025
Small Models, Big Impact: Efficient Corpus and Graph-Based Adaptation of Small Multilingual Language Models for Low-Resource Languages
ACL 2025
Modular Arithmetic: Language Models Solve Math Digit by Digit
IJCNLP 2025
Multilingual Political Views of Large Language Models: Identification and Steering
IJCNLP 2025
On Multilingual Encoder Language Model Compression for Low-Resource Languages
IJCNLP 2025
Language Arithmetics: Towards Systematic Language Neuron Identification and Manipulation
IJCNLP 2025
SONAR-SLT: Multilingual Sign Language Translation via Language-Agnostic Sentence Embedding Supervision
EMNLP 2025
TenseLoC: Tense Localization and Control in a Multilingual LLM
EMNLP 2025
Disentangling Mathematical Reasoning in LLMs: A Methodological Investigation of Internal Mechanisms
EMNLP 2025
The Lookahead Limitation: Why Multi-Operand Addition is Hard for LLMs
EMNLP 2025
Language Arithmetics: Towards Systematic Language Neuron Identification and Manipulation
AACL 2025
When Scale Meets Diversity: Evaluating Language Models on Fine-Grained Multilingual Claim Verification
ACL 2025
Rewiring the Transformer with Depth-Wise LSTMs
COLING 2024
Analysing Translation Artifacts: A Comparative Study of LLMs, NMTs, and Human Translations
EMNLP 2024
MMAR: Multilingual and Multimodal Anaphora Resolution in Instructional Videos
EMNLP 2024
When Your Cousin Has the Right Connections: Unsupervised Bilingual Lexicon Induction for Related Data-Imbalanced Languages
COLING 2024
Sign Language Translation with Sentence Embedding Supervision
ACL 2024
Are the Best Multilingual Document Embeddings simply Based on Sentence Embeddings?
EACL 2023
Investigating the Encoding of Words in BERT’s Neurons Using Feature Textualization
EMNLP 2023
Find-2-Find: Multitask Learning for Anaphora Resolution and Object Localization
EMNLP 2023
Translating away Translationese without Parallel Data
EMNLP 2023
Enriching Wayúunaiki-Spanish Neural Machine Translation with Linguistic Information
ACL 2023
Exploring Paracrawl for Document-level Neural Machine Translation
EACL 2023
Combining Noisy Semantic Signals with Orthographic Cues: Cognate Induction for the Indic Dialect Continuum
CONLL 2022
Spatio-temporal Sign Language Representation and Translation
EMNLP 2022
Exploiting Social Media Content for Self-Supervised Style Transfer
NAACL 2022
Chop and Change: Anaphora Resolution in Instructional Cooking Videos
AACL 2022
Combining Noisy Semantic Signals with Orthographic Cues: Cognate Induction for the Indic Dialect Continuum
EMNLP 2022
Mid-Air Hand Gestures for Post-Editing of Machine Translation
ACL 2021
Multi-Head Highly Parallelized LSTM Decoder for Neural Machine Translation
ACL 2021
A Bidirectional Transformer Based Alignment Model for Unsupervised Word Alignment
ACL 2021
Modeling Task-Aware MIMO Cardinality for Efficient Multilingual Neural Machine Translation
ACL 2021
Comparing Feature-Engineering and Feature-Learning Approaches for Multilingual Translationese Classification
EMNLP 2021
Investigating the Helpfulness of Word-Level Quality Estimation for Post-Editing Machine Translation Output
EMNLP 2021
TransIns: Document Translation with Markup Reinsertion
EMNLP 2021
Learning Hard Retrieval Decoder Attention for Transformers
EMNLP 2021
Multi-Head Highly Parallelized LSTM Decoder for Neural Machine Translation
IJCNLP 2021
A Bidirectional Transformer Based Alignment Model for Unsupervised Word Alignment
IJCNLP 2021
Mid-Air Hand Gestures for Post-Editing of Machine Translation
IJCNLP 2021
Modeling Task-Aware MIMO Cardinality for Efficient Multilingual Neural Machine Translation
IJCNLP 2021
Probing Word Translations in the Transformer and Trading Decoder for Encoder Layers
NAACL 2021
UdS-DFKI@WMT20: Unsupervised MT and Very Low Resource Supervised MT for German-Upper Sorbian
EMNLP 2020
Understanding Translationese in Multi-view Embedding Spaces
COLING 2020
The Transference Architecture for Automatic Post-Editing
COLING 2020
Efficient Context-Aware Neural Machine Translation with Layer-Wise Weighting and Input-Aware Gating
IJCAI 2020
Learning Source Phrase Representations for Neural Machine Translation
ACL 2020
Lipschitz Constrained Parameter Initialization for Deep Transformers
ACL 2020
MMPE: A Multi-Modal Interface for Post-Editing Machine Translation
ACL 2020
Dynamically Adjusting Transformer Batch Size by Monitoring Gradient Direction Change
ACL 2020
MMPE: A Multi-Modal Interface using Handwriting, Touch Reordering, and Speech Commands for Post-Editing Machine Translation
ACL 2020
How Human is Machine Translationese? Comparing Human and Machine Translations of Text and Speech
ACL 2020
Self-Induced Curriculum Learning in Self-Supervised Neural Machine Translation
EMNLP 2020
Translation Quality Estimation by Jointly Learning to Score and Rank
EMNLP 2020
USAAR-DFKI – The Transference Architecture for English–German Automatic Post-Editing
ACL 2019
UDS–DFKI Submission to the WMT2019 Czech–Polish Similar Language Translation Shared Task
ACL 2019
DFKI-NMT Submission to the WMT19 News Translation Task
ACL 2019
UdS Submission for the WMT 19 Automatic Post-Editing Task
ACL 2019
Self-Supervised Neural Machine Translation
ACL 2019
JU-Saarland Submission to the WMT2019 English–Gujarati Translation Shared Task
ACL 2019
Analysing Coreference in Transformer Outputs
EMNLP 2019
A Transformer-Based Multi-Source Automatic Post-Editing System
EMNLP 2018
Code-Mixed Question Answering Challenge: Crowd-sourcing Data and Techniques
ACL 2018
Neural Automatic Post-Editing Using Prior Alignment and Reranking
EACL 2017
An Extensive Empirical Evaluation of Character-Based Morphological Tagging for 14 Languages
EACL 2017
Common Round: Application of Language Technologies to Large-Scale Web Debates
EACL 2017
CATaLog Online: A Web-based CAT Tool for Distributed Translation with Data Capture for APE and Translation Process Research
COLING 2016
MacSaar at SemEval-2016 Task 11: Zipfian and Character Features for ComplexWord Identification
SEMEVAL 2016
USAAR at SemEval-2016 Task 13: Hyponym Endocentricity
SEMEVAL 2016
A Neural Network based Approach to Automatic Post-Editing
ACL 2016
Information Density and Quality Estimation Features as Translationese Indicators for Human Translation Classification
NAACL 2016
BIRA: Improved Predictive Exchange Word Clustering
NAACL 2016
Scaling Up Word Clustering
NAACL 2016
SAARSHEFF at SemEval-2016 Task 1: Semantic Textual Similarity with Machine Translation Evaluation Metrics and (eXtreme) Boosted Tree Ensembles
SEMEVAL 2016
WOLVESAAR at SemEval-2016 Task 1: Replicating the Success of Monolingual Word Alignment and Neural Embeddings for Semantic Textual Similarity
SEMEVAL 2016
Modeling Diachronic Change in Scientific Writing with Information Density
COLING 2016
Multi-Engine and Multi-Alignment Based Automatic Post-Editing and its Impact on Translation Productivity
COLING 2016
USAAR-WLV: Hypernym Generation with Deep Neural Nets
SEMEVAL 2015
ReVal: A Simple and Effective Machine Translation Evaluation Metric Based on Recurrent Neural Networks
EMNLP 2015
USAAR-SHEFFIELD: Semantic Textual Similarity with Deep Regression and Machine Translation Evaluation Metrics
SEMEVAL 2015
Active Learning for Post-Editing Based Incrementally Retrained MT
EACL 2014
CNGL: Grading Student Answers by Acts of Translation
SEMEVAL 2013
TMTprime: A Recommender System for MT and TM Integration
NAACL 2013
The Floating Arabic Dictionary: An Automatic Method for Updating a Lexical Database through the Detection and Lemmatization of Unknown Words
COLING 2012
Translation Quality-Based Supplementary Data Selection by Incremental Update of Translation Models
COLING 2012
An Evaluation of Statistical Post-Editing Systems Applied to RBMT and SMT Systems
COLING 2012
Simple and Effective Parameter Tuning for Domain Adaptation of Statistical Machine Translation
COLING 2012
Improved Spelling Error Detection and Correction for Arabic
COLING 2012
Identifying High-Impact Sub-Structures for Convolution Kernels in Document-level Sentiment Classification
ACL 2012
Head-Driven Hierarchical Phrase-based Translation
ACL 2012
Combining Multiple Alignments to Improve Machine Translation
COLING 2012
Consistent Translation using Discriminative Learning - A Translation Memory-inspired Approach
ACL 2011
From News to Comment: Resources and Benchmarks for Parsing the Language of Web 2.0
IJCNLP 2011
Hard Constraints for Grammatical Function Labelling
ACL 2010
Bridging SMT and TM with Translation Recommendation
ACL 2010
Integrating N-best SMT Outputs into a TM System
COLING 2010
Wide-Coverage NLP with Linguistically Expressive Grammars
ACL 2010
Adapting a WSJ-Trained Parser to Grammatically Noisy Text
ACL 2008
Dependency-Based N-Gram Models for General Purpose Sentence Realisation
COLING 2008
Exploiting Multi-Word Units in History-Based Probabilistic Generation
EMNLP 2007
Treebank Annotation Schemes and Parser Evaluation for German
CONLL 2007
Exploiting Multi-Word Units in History-Based Probabilistic Generation
CONLL 2007
A Comparative Evaluation of Deep and Shallow Approaches to the Automatic Detection of Common Grammatical Errors
EMNLP 2007
Recovering Non-Local Dependencies for Chinese
CONLL 2007
Recovering Non-Local Dependencies for Chinese
EMNLP 2007
A Comparative Evaluation of Deep and Shallow Approaches to the Automatic Detection of Common Grammatical Errors
CONLL 2007
Treebank Annotation Schemes and Parser Evaluation for German
EMNLP 2007
QuestionBank: Creating a Corpus of Parse-Annotated Questions
COLING 2006
Using Machine-Learning to Assign Function Labels to Parser Output for Spanish
COLING 2006
Using Machine-Learning to Assign Function Labels to Parser Output for Spanish
ACL 2006
Robust PCFG-Based Generation Using Automatically Acquired LFG Approximations
ACL 2006
QuestionBank: Creating a Corpus of Parse-Annotated Questions
ACL 2006
Robust PCFG-Based Generation Using Automatically Acquired LFG Approximations
COLING 2006
Large-Scale Induction and Evaluation of Lexical Resources from the Penn-II Treebank
ACL 2004
Long-Distance Dependency Resolution in Automatically Acquired Wide-Coverage PCFG-Based LFG Approximations
ACL 2004