conftrace_

Josef van Genabith

123 papers · 2004–2026 · 10 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+14 more ↓

🌍 Conference Polyglot (10) 🐣 Hot Topic Early Bird 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🏃 Academic Marathon (21)

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🏃 Academic Marathon (21) 🌟 Keyword Trendsetter Combo (4) 🏠 Conference Loyalist (34) 🤝 Dynamic Duo (19) 🔬 Deep Specialist (37) 🏆 Keyword Champion (3) ⚡ Prolific Year (6) 🗃️ Keyword Collector (291) ❓ The Questioner (2) 💎 Century Club (116) 🔥 Unstoppable (16) 📈 Trend Setter

Conferences

ACL (35) EMNLP (25) COLING (18) NAACL (10) IJCNLP (9) EACL (8) SEMEVAL (7) AACL (5) CONLL (5) IJCAI (1)

Top co-authors

Cristina España-Bonet (20) Simon Ostermann (17) Hongfei Xu (14) Deyi Xiong (12) Qiuhui Liu (12) Santanu Pal (11) Daniil Gurgurov (9) Aoife Cahill (8) Antonio Krüger (8) Liling Tan (8)

Keywords

neural machine translation (24) machine translation (16) low-resource language (9) attention mechanism (7) automatic post-editing (7) large language model (5) coreference resolution (5) unsupervised learning (5) sentence embedding (4) transformer architecture (4) human-computer interaction (4) multilingual translation (4) representation learning (4) self-supervised learning (4) sign language translation (4) cross-lingual transfer (4) recurrent neural network (3) transfer learning (3) multilingual nlp (3) multimodal learning (3)

Papers

Why Does Reinforcement Learning Generalize? A Feature-Level Mechanistic Study of Post-Training in Large Language Models ACL 2026 A Comprehensive Evaluation of Chain-of-Thought Faithfulness in Persian Classification Tasks EACL 2026 When Flores Bloomz Wrong: Cross-Direction Contamination in Machine Translation Evaluation EACL 2026 Modular Arithmetic: Language Models Solve Math Digit by Digit AACL 2025 Multilingual Political Views of Large Language Models: Identification and Steering AACL 2025 On Multilingual Encoder Language Model Compression for Low-Resource Languages AACL 2025 Reverse Probing: Evaluating Knowledge Transfer via Finetuned Task Embeddings for Coreference Resolution NAACL 2025 MultiCoPIE: A Multilingual Corpus of Potentially Idiomatic Expressions for Cross-lingual PIE Disambiguation NAACL 2025 AutoPsyC: Automatic Recognition of Psychodynamic Conflicts from Semi-structured Interviews with Large Language Models NAACL 2025 Continual Learning in Multilingual Sign Language Translation NAACL 2025 Small Models, Big Impact: Efficient Corpus and Graph-Based Adaptation of Small Multilingual Language Models for Low-Resource Languages ACL 2025 Modular Arithmetic: Language Models Solve Math Digit by Digit IJCNLP 2025 Multilingual Political Views of Large Language Models: Identification and Steering IJCNLP 2025 On Multilingual Encoder Language Model Compression for Low-Resource Languages IJCNLP 2025 Language Arithmetics: Towards Systematic Language Neuron Identification and Manipulation IJCNLP 2025 SONAR-SLT: Multilingual Sign Language Translation via Language-Agnostic Sentence Embedding Supervision EMNLP 2025 TenseLoC: Tense Localization and Control in a Multilingual LLM EMNLP 2025 Disentangling Mathematical Reasoning in LLMs: A Methodological Investigation of Internal Mechanisms EMNLP 2025 The Lookahead Limitation: Why Multi-Operand Addition is Hard for LLMs EMNLP 2025 Language Arithmetics: Towards Systematic Language Neuron Identification and Manipulation AACL 2025 When Scale Meets Diversity: Evaluating Language Models on Fine-Grained Multilingual Claim Verification ACL 2025 Rewiring the Transformer with Depth-Wise LSTMs COLING 2024 Analysing Translation Artifacts: A Comparative Study of LLMs, NMTs, and Human Translations EMNLP 2024 MMAR: Multilingual and Multimodal Anaphora Resolution in Instructional Videos EMNLP 2024 When Your Cousin Has the Right Connections: Unsupervised Bilingual Lexicon Induction for Related Data-Imbalanced Languages COLING 2024 Sign Language Translation with Sentence Embedding Supervision ACL 2024 Are the Best Multilingual Document Embeddings simply Based on Sentence Embeddings? EACL 2023 Investigating the Encoding of Words in BERT’s Neurons Using Feature Textualization EMNLP 2023 Find-2-Find: Multitask Learning for Anaphora Resolution and Object Localization EMNLP 2023 Translating away Translationese without Parallel Data EMNLP 2023 Enriching Wayúunaiki-Spanish Neural Machine Translation with Linguistic Information ACL 2023 Exploring Paracrawl for Document-level Neural Machine Translation EACL 2023 Combining Noisy Semantic Signals with Orthographic Cues: Cognate Induction for the Indic Dialect Continuum CONLL 2022 Spatio-temporal Sign Language Representation and Translation EMNLP 2022 Exploiting Social Media Content for Self-Supervised Style Transfer NAACL 2022 Chop and Change: Anaphora Resolution in Instructional Cooking Videos AACL 2022 Combining Noisy Semantic Signals with Orthographic Cues: Cognate Induction for the Indic Dialect Continuum EMNLP 2022 Mid-Air Hand Gestures for Post-Editing of Machine Translation ACL 2021 Multi-Head Highly Parallelized LSTM Decoder for Neural Machine Translation ACL 2021 A Bidirectional Transformer Based Alignment Model for Unsupervised Word Alignment ACL 2021 Modeling Task-Aware MIMO Cardinality for Efficient Multilingual Neural Machine Translation ACL 2021 Comparing Feature-Engineering and Feature-Learning Approaches for Multilingual Translationese Classification EMNLP 2021 Investigating the Helpfulness of Word-Level Quality Estimation for Post-Editing Machine Translation Output EMNLP 2021 TransIns: Document Translation with Markup Reinsertion EMNLP 2021 Learning Hard Retrieval Decoder Attention for Transformers EMNLP 2021 Multi-Head Highly Parallelized LSTM Decoder for Neural Machine Translation IJCNLP 2021 A Bidirectional Transformer Based Alignment Model for Unsupervised Word Alignment IJCNLP 2021 Mid-Air Hand Gestures for Post-Editing of Machine Translation IJCNLP 2021 Modeling Task-Aware MIMO Cardinality for Efficient Multilingual Neural Machine Translation IJCNLP 2021 Probing Word Translations in the Transformer and Trading Decoder for Encoder Layers NAACL 2021 UdS-DFKI@WMT20: Unsupervised MT and Very Low Resource Supervised MT for German-Upper Sorbian EMNLP 2020 Understanding Translationese in Multi-view Embedding Spaces COLING 2020 The Transference Architecture for Automatic Post-Editing COLING 2020 Efficient Context-Aware Neural Machine Translation with Layer-Wise Weighting and Input-Aware Gating IJCAI 2020 Learning Source Phrase Representations for Neural Machine Translation ACL 2020 Lipschitz Constrained Parameter Initialization for Deep Transformers ACL 2020 MMPE: A Multi-Modal Interface for Post-Editing Machine Translation ACL 2020 Dynamically Adjusting Transformer Batch Size by Monitoring Gradient Direction Change ACL 2020 MMPE: A Multi-Modal Interface using Handwriting, Touch Reordering, and Speech Commands for Post-Editing Machine Translation ACL 2020 How Human is Machine Translationese? Comparing Human and Machine Translations of Text and Speech ACL 2020 Self-Induced Curriculum Learning in Self-Supervised Neural Machine Translation EMNLP 2020 Translation Quality Estimation by Jointly Learning to Score and Rank EMNLP 2020 USAAR-DFKI – The Transference Architecture for English–German Automatic Post-Editing ACL 2019 UDS–DFKI Submission to the WMT2019 Czech–Polish Similar Language Translation Shared Task ACL 2019 DFKI-NMT Submission to the WMT19 News Translation Task ACL 2019 UdS Submission for the WMT 19 Automatic Post-Editing Task ACL 2019 Self-Supervised Neural Machine Translation ACL 2019 JU-Saarland Submission to the WMT2019 English–Gujarati Translation Shared Task ACL 2019 Analysing Coreference in Transformer Outputs EMNLP 2019 A Transformer-Based Multi-Source Automatic Post-Editing System EMNLP 2018 Code-Mixed Question Answering Challenge: Crowd-sourcing Data and Techniques ACL 2018 Neural Automatic Post-Editing Using Prior Alignment and Reranking EACL 2017 An Extensive Empirical Evaluation of Character-Based Morphological Tagging for 14 Languages EACL 2017 Common Round: Application of Language Technologies to Large-Scale Web Debates EACL 2017 CATaLog Online: A Web-based CAT Tool for Distributed Translation with Data Capture for APE and Translation Process Research COLING 2016 MacSaar at SemEval-2016 Task 11: Zipfian and Character Features for ComplexWord Identification SEMEVAL 2016 USAAR at SemEval-2016 Task 13: Hyponym Endocentricity SEMEVAL 2016 A Neural Network based Approach to Automatic Post-Editing ACL 2016 Information Density and Quality Estimation Features as Translationese Indicators for Human Translation Classification NAACL 2016 BIRA: Improved Predictive Exchange Word Clustering NAACL 2016 Scaling Up Word Clustering NAACL 2016 SAARSHEFF at SemEval-2016 Task 1: Semantic Textual Similarity with Machine Translation Evaluation Metrics and (eXtreme) Boosted Tree Ensembles SEMEVAL 2016 WOLVESAAR at SemEval-2016 Task 1: Replicating the Success of Monolingual Word Alignment and Neural Embeddings for Semantic Textual Similarity SEMEVAL 2016 Modeling Diachronic Change in Scientific Writing with Information Density COLING 2016 Multi-Engine and Multi-Alignment Based Automatic Post-Editing and its Impact on Translation Productivity COLING 2016 USAAR-WLV: Hypernym Generation with Deep Neural Nets SEMEVAL 2015 ReVal: A Simple and Effective Machine Translation Evaluation Metric Based on Recurrent Neural Networks EMNLP 2015 USAAR-SHEFFIELD: Semantic Textual Similarity with Deep Regression and Machine Translation Evaluation Metrics SEMEVAL 2015 Active Learning for Post-Editing Based Incrementally Retrained MT EACL 2014 CNGL: Grading Student Answers by Acts of Translation SEMEVAL 2013 TMTprime: A Recommender System for MT and TM Integration NAACL 2013 The Floating Arabic Dictionary: An Automatic Method for Updating a Lexical Database through the Detection and Lemmatization of Unknown Words COLING 2012 Translation Quality-Based Supplementary Data Selection by Incremental Update of Translation Models COLING 2012 An Evaluation of Statistical Post-Editing Systems Applied to RBMT and SMT Systems COLING 2012 Simple and Effective Parameter Tuning for Domain Adaptation of Statistical Machine Translation COLING 2012 Improved Spelling Error Detection and Correction for Arabic COLING 2012 Identifying High-Impact Sub-Structures for Convolution Kernels in Document-level Sentiment Classification ACL 2012 Head-Driven Hierarchical Phrase-based Translation ACL 2012 Combining Multiple Alignments to Improve Machine Translation COLING 2012 Consistent Translation using Discriminative Learning - A Translation Memory-inspired Approach ACL 2011 From News to Comment: Resources and Benchmarks for Parsing the Language of Web 2.0 IJCNLP 2011 Hard Constraints for Grammatical Function Labelling ACL 2010 Bridging SMT and TM with Translation Recommendation ACL 2010 Integrating N-best SMT Outputs into a TM System COLING 2010 Wide-Coverage NLP with Linguistically Expressive Grammars ACL 2010 Adapting a WSJ-Trained Parser to Grammatically Noisy Text ACL 2008 Dependency-Based N-Gram Models for General Purpose Sentence Realisation COLING 2008 Exploiting Multi-Word Units in History-Based Probabilistic Generation EMNLP 2007 Treebank Annotation Schemes and Parser Evaluation for German CONLL 2007 Exploiting Multi-Word Units in History-Based Probabilistic Generation CONLL 2007 A Comparative Evaluation of Deep and Shallow Approaches to the Automatic Detection of Common Grammatical Errors EMNLP 2007 Recovering Non-Local Dependencies for Chinese CONLL 2007 Recovering Non-Local Dependencies for Chinese EMNLP 2007 A Comparative Evaluation of Deep and Shallow Approaches to the Automatic Detection of Common Grammatical Errors CONLL 2007 Treebank Annotation Schemes and Parser Evaluation for German EMNLP 2007 QuestionBank: Creating a Corpus of Parse-Annotated Questions COLING 2006 Using Machine-Learning to Assign Function Labels to Parser Output for Spanish COLING 2006 Using Machine-Learning to Assign Function Labels to Parser Output for Spanish ACL 2006 Robust PCFG-Based Generation Using Automatically Acquired LFG Approximations ACL 2006 QuestionBank: Creating a Corpus of Parse-Annotated Questions ACL 2006 Robust PCFG-Based Generation Using Automatically Acquired LFG Approximations COLING 2006 Large-Scale Induction and Evaluation of Lexical Resources from the Penn-II Treebank ACL 2004 Long-Distance Dependency Resolution in Automatically Acquired Wide-Coverage PCFG-Based LFG Approximations ACL 2004