Rob van der Goot
64 papers · 2014–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
🏃 Academic Marathon (11) 🌍 Conference Polyglot (7) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (14)
🌈
Renaissance Researcher
(8)
🌍
Conference Polyglot
(7)
🏃
Academic Marathon
(11)
🐺
Lone Wolf
(10)
🤝
Dynamic Duo
(29)
🔬
Deep Specialist
(16)
🏆
Keyword Champion
(2)
🗃️
Keyword Collector
(247)
⚡
Prolific Year
(10)
❓
The Questioner
(5)
🔥
Unstoppable
(9)
💎
Century Club
(63)
Conferences
EMNLP (16)
ACL (12)
COLING (12)
EACL (11)
NAACL (7)
SEMEVAL (5)
CONLL (1)
Top co-authors
Research topics
Keywords
transfer learning
(12)
dependency parsing
(11)
text classification
(10)
multi-task learning
(10)
language model
(9)
lexical normalization
(7)
multilingual nlp
(7)
cross-lingual transfer
(7)
sequence labeling
(5)
zero-shot learning
(5)
named entity recognition
(4)
text normalization
(4)
syntactic parsing
(4)
social media text
(3)
low-resource language
(3)
contextualized embedding
(3)
domain adaptation
(3)
language identification
(3)
model evaluation
(3)
natural language processing
(3)
Papers
CommonLID: Re-evaluating State-of-the-Art Language Identification Performance on Web Data
ACL 2026
Bias in Danish Medical Notes: Infection Classification of Long Texts Using Transformer and LSTM Architectures Coupled with BERT
NAACL 2025
DaKultur: Evaluating the Cultural Awareness of Language Models for Danish with Native Speakers
NAACL 2025
Do Syntactic Categories Help in Developmentally Motivated Curriculum Learning for Language Models?
EMNLP 2025
Identifying Open Challenges in Language Identification
ACL 2025
data2lang2vec: Data Driven Typological Features Completion
COLING 2025
Iterative Structured Knowledge Distillation: Optimizing Language Models Through Layer-by-Layer Distillation
COLING 2025
KARRIEREWEGE: A large scale Career Path Prediction Dataset
COLING 2025
How to age BERT Well: Continuous Training for Historical Language Adaptation
COLING 2025
Findings of the VarDial Evaluation Campaign 2025: The NorSID Shared Task on Norwegian Slot, Intent and Dialect Identification
COLING 2025
DECAF: A Dynamically Extensible Corpus Analysis Framework
ACL 2025
Crossing Domains without Labels: Distant Supervision for Term Extraction
EMNLP 2025
DistaLs: a Comprehensive Collection of Language Distance Measures
EMNLP 2025
Deep Learning-based Computational Job Market Analysis: A Survey on Skill Extraction and Classification from Job Postings
EACL 2024
Entity Linking in the Job Market Domain
EACL 2024
Where are we Still Split on Tokenization?
EACL 2024
NNOSE: Nearest Neighbor Occupational Skill Extraction
EACL 2024
What’s wrong with your model? A Quantitative Analysis of Relation Classification
NAACL 2024
Big City Bias: Evaluating the Impact of Metropolitan Size on Computational Job Market Abilities of Language Models
EACL 2024
Can Humans Identify Domains?
COLING 2024
Enough Is Enough! a Case Study on the Effect of Data Size for Evaluation Using Universal Dependencies
COLING 2024
How to Encode Domain Information in Relation Classification
COLING 2024
Slot and Intent Detection Resources for Bavarian and Lithuanian: Assessing Translations vs Natural Queries to Digital Assistants
COLING 2024
EEVEE: An Easy Annotation Tool for Natural Language Processing
EACL 2024
MaChAmp at SemEval-2023 tasks 2, 3, 4, 5, 7, 8, 9, 10, 11, and 12: On the Effectiveness of Intermediate Training on an Uncurated Collection of Datasets.
ACL 2023
ESCOXLM-R: Multilingual Taxonomy-driven Pre-training for the Job Market Domain
ACL 2023
Native Language Prediction from Gaze: a Reproducibility Study
ACL 2023
Silver Syntax Pre-training for Cross-Domain Relation Extraction
ACL 2023
Findings of the VarDial Evaluation Campaign 2023
EACL 2023
Establishing Trustworthiness: Rethinking Tasks and Model Evaluation
EMNLP 2023
Subspace Chronicles: How Linguistic Information Emerges, Shifts and Interacts during Language Model Training
EMNLP 2023
MaChAmp at SemEval-2023 tasks 2, 3, 4, 5, 7, 8, 9, 10, 11, and 12: On the Effectiveness of Intermediate Training on an Uncurated Collection of Datasets.
SEMEVAL 2023
MaChAmp at SemEval-2022 Tasks 2, 3, 4, 6, 10, 11, and 12: Multi-task Multi-lingual Learning for a Pre-selected Set of Semantic Datasets
NAACL 2022
Spectral Probing
EMNLP 2022
Experimental Standards for Deep Learning in Natural Language Processing Research
EMNLP 2022
On Language Spaces, Scales and Cross-Lingual Transfer of UD Parsers
EMNLP 2022
MaChAmp at SemEval-2022 Tasks 2, 3, 4, 6, 10, 11, and 12: Multi-task Multi-lingual Learning for a Pre-selected Set of Semantic Datasets
SEMEVAL 2022
Sort by Structure: Language Model Ranking as Dependency Probing
NAACL 2022
Tafsir Dataset: A Novel Multi-Task Benchmark for Named Entity Recognition and Topic Modeling in Classical Arabic Literature
COLING 2022
On Language Spaces, Scales and Cross-Lingual Transfer of UD Parsers
CONLL 2022
Increasing Robustness for Cross-domain Dialogue Act Classification on Social Media Data
COLING 2022
Probing for Labeled Dependency Trees
ACL 2022
Massive Choice, Ample Tasks (MaChAmp): A Toolkit for Multi-task Learning in NLP
EACL 2021
We Need to Talk About train-dev-test Splits
EMNLP 2021
Genre as Weak Supervision for Cross-lingual Dependency Parsing
EMNLP 2021
MultiLexNorm: A Shared Task on Multilingual Lexical Normalization
EMNLP 2021
CL-MoNoise: Cross-lingual Lexical Normalization
EMNLP 2021
From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken Language Understanding
NAACL 2021
Much Gracias: Semi-supervised Code-switch Detection for Spanish-English: How far can we get?
NAACL 2021
On the Effectiveness of Dataset Embeddings in Mono-lingual,Multi-lingual and Zero-shot Conditions
EACL 2021
Challenges in Annotating and Parsing Spoken, Code-switched, Frisian-Dutch Data
EACL 2021
Lexical Normalization for Code-switched Data and its Effect on POS Tagging
EACL 2021
Biomedical Event Extraction as Sequence Labeling
EMNLP 2020
NLP North at WNUT-2020 Task 2: Pre-training versus Ensembling for Detection of Informative COVID-19 English Tweets
EMNLP 2020
DaN+: Danish Nested Named Entities and Lexical Normalization
COLING 2020
sthruggle at SemEval-2019 Task 5: An Ensemble Approach to Hate Speech Detection
SEMEVAL 2019
Multi-Team: A Multi-attention, Multi-decoder Approach to Morphological Analysis.
ACL 2019
An In-depth Analysis of the Effect of Lexical Normalization on the Dependency Parsing of Social Media
EMNLP 2019
MoNoise: A Multi-lingual and Easy-to-use Lexical Normalization Tool
ACL 2019
Bleaching Text: Abstract Features for Cross-lingual Gender Prediction
ACL 2018
Modeling Input Uncertainty in Neural Network Dependency Parsing
EMNLP 2018
Parser Adaptation for Social Media by Integrating Normalization
ACL 2017
ROB: Using Semantic Meaning to Recognize Paraphrases
SEMEVAL 2015
The Meaning Factory: Formal Semantics for Recognizing Textual Entailment and Determining Semantic Similarity
SEMEVAL 2014