Rob van der Goot

64 papers · 2014–2026 · 7 conferences · across top CS/AI conferences

Achievements

+12 more ↓

🏃 Academic Marathon (11) 🌍 Conference Polyglot (7) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (14)

🌈 Renaissance Researcher (8) 🌍 Conference Polyglot (7) 🏃 Academic Marathon (11) 🐺 Lone Wolf (10) 🤝 Dynamic Duo (29) 🔬 Deep Specialist (16) 🏆 Keyword Champion (2) 🗃️ Keyword Collector (247) ⚡ Prolific Year (10) ❓ The Questioner (5) 🔥 Unstoppable (9) 💎 Century Club (63)

Conferences

EMNLP (16) ACL (12) COLING (12) EACL (11) NAACL (7) SEMEVAL (5) CONLL (1)

Top co-authors

Barbara Plank (29) Max Müller-Eberstein (13) Mike Zhang (8) Elisa Bassignana (6) Alan Ramponi (4) Gertjan van Noord (4) Ahmet Üstün (4) Tanja Samardžić (3) Nikola Ljubešić (3) Elena Senger (3)

Research topics

Applications (1)

Keywords

transfer learning (12) dependency parsing (11) text classification (10) multi-task learning (10) language model (9) lexical normalization (7) multilingual nlp (7) cross-lingual transfer (7) sequence labeling (5) zero-shot learning (5) named entity recognition (4) text normalization (4) syntactic parsing (4) social media text (3) low-resource language (3) contextualized embedding (3) domain adaptation (3) language identification (3) model evaluation (3) natural language processing (3)

Papers

CommonLID: Re-evaluating State-of-the-Art Language Identification Performance on Web Data ACL 2026 Bias in Danish Medical Notes: Infection Classification of Long Texts Using Transformer and LSTM Architectures Coupled with BERT NAACL 2025 DaKultur: Evaluating the Cultural Awareness of Language Models for Danish with Native Speakers NAACL 2025 Do Syntactic Categories Help in Developmentally Motivated Curriculum Learning for Language Models? EMNLP 2025 Identifying Open Challenges in Language Identification ACL 2025 data2lang2vec: Data Driven Typological Features Completion COLING 2025 Iterative Structured Knowledge Distillation: Optimizing Language Models Through Layer-by-Layer Distillation COLING 2025 KARRIEREWEGE: A large scale Career Path Prediction Dataset COLING 2025 How to age BERT Well: Continuous Training for Historical Language Adaptation COLING 2025 Findings of the VarDial Evaluation Campaign 2025: The NorSID Shared Task on Norwegian Slot, Intent and Dialect Identification COLING 2025 DECAF: A Dynamically Extensible Corpus Analysis Framework ACL 2025 Crossing Domains without Labels: Distant Supervision for Term Extraction EMNLP 2025 DistaLs: a Comprehensive Collection of Language Distance Measures EMNLP 2025 Deep Learning-based Computational Job Market Analysis: A Survey on Skill Extraction and Classification from Job Postings EACL 2024 Entity Linking in the Job Market Domain EACL 2024 Where are we Still Split on Tokenization? EACL 2024 NNOSE: Nearest Neighbor Occupational Skill Extraction EACL 2024 What’s wrong with your model? A Quantitative Analysis of Relation Classification NAACL 2024 Big City Bias: Evaluating the Impact of Metropolitan Size on Computational Job Market Abilities of Language Models EACL 2024 Can Humans Identify Domains? COLING 2024 Enough Is Enough! a Case Study on the Effect of Data Size for Evaluation Using Universal Dependencies COLING 2024 How to Encode Domain Information in Relation Classification COLING 2024 Slot and Intent Detection Resources for Bavarian and Lithuanian: Assessing Translations vs Natural Queries to Digital Assistants COLING 2024 EEVEE: An Easy Annotation Tool for Natural Language Processing EACL 2024 MaChAmp at SemEval-2023 tasks 2, 3, 4, 5, 7, 8, 9, 10, 11, and 12: On the Effectiveness of Intermediate Training on an Uncurated Collection of Datasets. ACL 2023 ESCOXLM-R: Multilingual Taxonomy-driven Pre-training for the Job Market Domain ACL 2023 Native Language Prediction from Gaze: a Reproducibility Study ACL 2023 Silver Syntax Pre-training for Cross-Domain Relation Extraction ACL 2023 Findings of the VarDial Evaluation Campaign 2023 EACL 2023 Establishing Trustworthiness: Rethinking Tasks and Model Evaluation EMNLP 2023 Subspace Chronicles: How Linguistic Information Emerges, Shifts and Interacts during Language Model Training EMNLP 2023 MaChAmp at SemEval-2023 tasks 2, 3, 4, 5, 7, 8, 9, 10, 11, and 12: On the Effectiveness of Intermediate Training on an Uncurated Collection of Datasets. SEMEVAL 2023 MaChAmp at SemEval-2022 Tasks 2, 3, 4, 6, 10, 11, and 12: Multi-task Multi-lingual Learning for a Pre-selected Set of Semantic Datasets NAACL 2022 Spectral Probing EMNLP 2022 Experimental Standards for Deep Learning in Natural Language Processing Research EMNLP 2022 On Language Spaces, Scales and Cross-Lingual Transfer of UD Parsers EMNLP 2022 MaChAmp at SemEval-2022 Tasks 2, 3, 4, 6, 10, 11, and 12: Multi-task Multi-lingual Learning for a Pre-selected Set of Semantic Datasets SEMEVAL 2022 Sort by Structure: Language Model Ranking as Dependency Probing NAACL 2022 Tafsir Dataset: A Novel Multi-Task Benchmark for Named Entity Recognition and Topic Modeling in Classical Arabic Literature COLING 2022 On Language Spaces, Scales and Cross-Lingual Transfer of UD Parsers CONLL 2022 Increasing Robustness for Cross-domain Dialogue Act Classification on Social Media Data COLING 2022 Probing for Labeled Dependency Trees ACL 2022 Massive Choice, Ample Tasks (MaChAmp): A Toolkit for Multi-task Learning in NLP EACL 2021 We Need to Talk About train-dev-test Splits EMNLP 2021 Genre as Weak Supervision for Cross-lingual Dependency Parsing EMNLP 2021 MultiLexNorm: A Shared Task on Multilingual Lexical Normalization EMNLP 2021 CL-MoNoise: Cross-lingual Lexical Normalization EMNLP 2021 From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken Language Understanding NAACL 2021 Much Gracias: Semi-supervised Code-switch Detection for Spanish-English: How far can we get? NAACL 2021 On the Effectiveness of Dataset Embeddings in Mono-lingual,Multi-lingual and Zero-shot Conditions EACL 2021 Challenges in Annotating and Parsing Spoken, Code-switched, Frisian-Dutch Data EACL 2021 Lexical Normalization for Code-switched Data and its Effect on POS Tagging EACL 2021 Biomedical Event Extraction as Sequence Labeling EMNLP 2020 NLP North at WNUT-2020 Task 2: Pre-training versus Ensembling for Detection of Informative COVID-19 English Tweets EMNLP 2020 DaN+: Danish Nested Named Entities and Lexical Normalization COLING 2020 sthruggle at SemEval-2019 Task 5: An Ensemble Approach to Hate Speech Detection SEMEVAL 2019 Multi-Team: A Multi-attention, Multi-decoder Approach to Morphological Analysis. ACL 2019 An In-depth Analysis of the Effect of Lexical Normalization on the Dependency Parsing of Social Media EMNLP 2019 MoNoise: A Multi-lingual and Easy-to-use Lexical Normalization Tool ACL 2019 Bleaching Text: Abstract Features for Cross-lingual Gender Prediction ACL 2018 Modeling Input Uncertainty in Neural Network Dependency Parsing EMNLP 2018 Parser Adaptation for Social Media by Integrating Normalization ACL 2017 ROB: Using Semantic Meaning to Recognize Paraphrases SEMEVAL 2015 The Meaning Factory: Formal Semantics for Recognizing Textual Entailment and Determining Semantic Similarity SEMEVAL 2014