Barbara Plank
173 papers · 2009–2026 · 13 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+17 more ↓ Show less ↑
🧭 Keyword Pioneer 🗺️ Taxonomy Completionist (13) 🌈 Renaissance Researcher (6) 🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird
🗺️
Taxonomy Completionist
(13)
🧭
Keyword Pioneer
🐣
Hot Topic Early Bird
🌟
Keyword Trendsetter Combo
(3)
🏠
Conference Loyalist
(21)
🐺
Lone Wolf
(10)
👥
Mega-Team
(20)
🧬
Topic Evolution
🏆
Keyword Champion
(6)
🤝
Dynamic Duo
(29)
🔬
Deep Specialist
(28)
⚡
Prolific Year
(18)
💎
Century Club
(167)
🔥
Unstoppable
(13)
❓
The Questioner
(23)
📈
Trend Setter
🗃️
Keyword Collector
(567)
Conferences
EMNLP (48)
ACL (43)
EACL (22)
COLING (21)
NAACL (17)
SEMEVAL (8)
IJCNLP (6)
CONLL (3)
AAAI (1)
ICLR (1)
ICML (1)
IJCAI (1)
INTERSPEECH (1)
Top co-authors
Research topics
Keywords
large language model
(24)
transfer learning
(15)
text classification
(13)
multi-task learning
(12)
natural language processing
(12)
cross-lingual transfer
(10)
dependency parsing
(10)
named entity recognition
(10)
part-of-speech tagging
(10)
multilingual nlp
(9)
annotation disagreement
(8)
zero-shot learning
(8)
low-resource language
(7)
language model
(7)
human label variation
(7)
natural language inference
(6)
model evaluation
(6)
uncertainty quantification
(6)
domain adaptation
(6)
neural network
(6)
Papers
Survey Response Generation: Generating Closed-Ended Survey Responses In-Silico with Large Language Models
ACL 2026
If Probable, Then Acceptable? Understanding Conditional Acceptability Judgments in Large Language Models
EACL 2026
Too Open for Opinion? Embracing Open-Endedness in Large Language Models for Social Simulation
EACL 2026
Controlling Reading Ease with Gaze-Guided Text Generation
EACL 2026
When Meanings Meet: Investigating the Emergence and Quality of Shared Concept Spaces during Multilingual Language Model Training
EACL 2026
Standard-to-Dialect Transfer Trends Differ across Text and Speech: A Case Study on Intent and Topic Classification in German Dialects
ACL 2026
Reason to Rote: Rethinking Memorization in Reasoning
EMNLP 2025
Relevant for the Right Reasons? Investigating Lexical Biases in Zero-Shot and Instruction-Tuned Rerankers
EMNLP 2025
BlackboxNLP-2025 MIB Shared Task: Exploring Ensemble Strategies for Circuit Localization Methods
EMNLP 2025
Mind the Uncertainty in Human Disagreement: Evaluating Discrepancies Between Model Predictions and Human Responses in VQA
AAAI 2025
Algorithmic Fidelity of Large Language Models in Generating Synthetic German Public Opinions: A Case Study
ACL 2025
Pragmatics in the Era of Large Language Models: A Survey on Datasets, Evaluation, Opportunities and Challenges
ACL 2025
Circuit Compositions: Exploring Modular Structures in Transformer-Based Language Models
ACL 2025
Probing LLMs for Multilingual Discourse Generalization Through a Unified Label Set
ACL 2025
What’s the Difference? Supporting Users in Identifying the Effects of Prompt and Model Changes Through Token Patterns
ACL 2025
LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks
ACL 2025
A Rose by Any Other Name: LLM-Generated Explanations Are Good Proxies for Human Explanations to Collect Label Distributions on NLI
ACL 2025
Do LLMs Give Psychometrically Plausible Responses in Educational Assessments?
ACL 2025
Better Aligned with Survey Respondents or Training Data? Unveiling Political Leanings of LLMs on U.S. Supreme Court Cases
ACL 2025
MAKIEval: A Multilingual Automatic WiKidata-based Framework for Cultural Awareness Evaluation for LLMs
EMNLP 2025
Make Every Letter Count: Building Dialect Variation Dictionaries from Monolingual Corpora
EMNLP 2025
Evaluating Large Language Models for Cross-Lingual Retrieval
EMNLP 2025
What Media Frames Reveal About Stance: A Dataset and Study about Memes in Climate Change Discourse
EMNLP 2025
Tracing Multilingual Factual Knowledge Acquisition in Pretraining
EMNLP 2025
Crossing Domains without Labels: Distant Supervision for Term Extraction
EMNLP 2025
LiTEx: A Linguistic Taxonomy of Explanations for Understanding Within-Label Variation in Natural Language Inference
EMNLP 2025
Threading the Needle: Reweaving Chain-of-Thought Reasoning to Explain Human Label Variation
EMNLP 2025
The Validation Gap: A Mechanistic Analysis of How Language Models Compute Arithmetic but Fail to Validate It
EMNLP 2025
Disentangling Subjectivity and Uncertainty for Hate Speech Annotation and Modeling using Gaze
EMNLP 2025
Evaluating Pixel Language Models on Non-Standardized Languages
COLING 2025
Cross-Dialect Information Retrieval: Information Access in Low-Resource and High-Variance Languages
COLING 2025
KARRIEREWEGE: A large scale Career Path Prediction Dataset
COLING 2025
Neural Text Normalization for Luxembourgish Using Real-Life Variation Data
COLING 2025
Improving Dialectal Slot and Intent Detection with Auxiliary Tasks: A Multi-Dialectal Bavarian Case Study
COLING 2025
Add Noise, Tasks, or Layers? MaiNLP at the VarDial 2025 Shared Task on Norwegian Dialectal Slot and Intent Detection
COLING 2025
RAcQUEt: Unveiling the Dangers of Overlooked Referential Ambiguity in Visual LLMs
EMNLP 2025
M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis
EMNLP 2025
Dialetto, ma Quanto Dialetto? Transcribing and Evaluating Dialects on a Continuum
NAACL 2025
Lost in Inference: Rediscovering the Role of Natural Language Inference for Large Language Models
NAACL 2025
Surgical, Cheap, and Flexible: Mitigating False Refusal in Language Models via Single Vector Ablation
ICLR 2025
LeWiDi-2025 at NLPerspectives: Third Edition of the Learning with Disagreements Shared Task
EMNLP 2025
BoN Appetit Team at LeWiDi-2025: Best-of-N Test-time Scaling Can Not Stomach Annotation Disagreements (Yet)
EMNLP 2025
Aligning NLP Models with Target Population Perspectives using PAIR: Population-Aligned Instance Replication
EMNLP 2025
Revisiting Active Learning under (Human) Label Variation
EMNLP 2025
Deep Learning-based Computational Job Market Analysis: A Survey on Skill Extraction and Classification from Job Postings
EACL 2024
Exploring the Robustness of Task-oriented Dialogue Systems for Colloquial German Varieties
EACL 2024
How to Encode Domain Information in Relation Classification
COLING 2024
What’s wrong with your model? A Quantitative Analysis of Relation Classification
NAACL 2024
MaiNLP at SemEval-2024 Task 1: Analyzing Source Language Selection in Cross-Lingual Textual Relatedness
NAACL 2024
Universal NER: A Gold-Standard Multilingual Named Entity Recognition Benchmark
NAACL 2024
Position: Insights from Survey Methodology can Improve Training Data
ICML 2024
IndirectQA: Understanding Indirect Answers to Implicit Polar Questions in French and Spanish
COLING 2024
MaiBaam: A Multi-Dialectal Bavarian Universal Dependency Treebank
COLING 2024
Sebastian, Basti, Wastl?! Recognizing Named Entities in Bavarian Dialectal Data
COLING 2024
Slot and Intent Detection Resources for Bavarian and Lithuanian: Assessing Translations vs Natural Queries to Digital Assistants
COLING 2024
MultiClimate: Multimodal Stance Detection on Climate Change Videos
EMNLP 2024
To Know or Not To Know? Analyzing Self-Consistency of Large Language Models under Ambiguity
EMNLP 2024
“Seeing the Big through the Small”: Can LLMs Approximate Human Judgment Distributions on NLI from a Few Explanations?
EMNLP 2024
The Potential and Challenges of Evaluating Attitudes, Opinions, and Values in Large Language Models
EMNLP 2024
EEVEE: An Easy Annotation Tool for Natural Language Processing
EACL 2024
Donkii: Characterizing and Detecting Errors in Instruction-Tuning Datasets
EACL 2024
Entity Linking in the Job Market Domain
EACL 2024
Interpreting Predictive Probabilities: Model Confidence or Human Label Variation?
EACL 2024
NNOSE: Nearest Neighbor Occupational Skill Extraction
EACL 2024
Through the Lens of Split Vote: Exploring Disagreement, Difficulty and Calibration in Legal Case Outcome Classification
ACL 2024
VariErr NLI: Separating Annotation Error from Human Label Variation
ACL 2024
Comparing Inferential Strategies of Humans and Large Language Models in Deductive Reasoning
ACL 2024
What Do Dialect Speakers Want? A Survey of Attitudes Towards Language Technology for German Dialects
ACL 2024
“My Answer is C”: First-Token Probabilities Do Not Match Text Answers in Instruction-Tuned Language Models
ACL 2024
CLIMATELI: Evaluating Entity Linking on Climate Change Data
ACL 2024
Liar, Liar, Logical Mire: A Benchmark for Suppositional Reasoning in Large Language Models
EMNLP 2024
Different Tastes of Entities: Investigating Human Label Variation in Named Entity Annotations
EACL 2024
More Labels or Cases? Assessing Label Variation in Natural Language Inference
EACL 2024
MaiNLP at SemEval-2024 Task 1: Analyzing Source Language Selection in Cross-Lingual Textual Relatedness
SEMEVAL 2024
ESCOXLM-R: Multilingual Taxonomy-driven Pre-training for the Job Market Domain
ACL 2023
How to Distill your BERT: An Empirical Study on the Impact of Weight Initialisation and Distillation Objectives
ACL 2023
Boosting Zero-shot Cross-lingual Retrieval by Training on Artificially Code-Switched Data
ACL 2023
Silver Syntax Pre-training for Cross-Domain Relation Extraction
ACL 2023
ActiveAED: A Human in the Loop Improves Annotation Error Detection
ACL 2023
SemEval-2023 Task 11: Learning with Disagreements (LeWiDi)
ACL 2023
Does Manipulating Tokenization Aid Cross-Lingual Transfer? A Study on POS Tagging for Non-Standardized Languages
EACL 2023
Findings of the VarDial Evaluation Campaign 2023
EACL 2023
Establishing Trustworthiness: Rethinking Tasks and Model Evaluation
EMNLP 2023
ACTOR: Active Learning with Annotator-specific Classification Heads to Embrace Human Label Variation
EMNLP 2023
From Dissonance to Insights: Dissecting Disagreements in Rationale Construction for Case Outcome Classification
EMNLP 2023
What Comes Next? Evaluating Uncertainty in Neural Text Generators Against Human Production Variability
EMNLP 2023
Subspace Chronicles: How Linguistic Information Emerges, Shifts and Interacts during Language Model Training
EMNLP 2023
SemEval-2023 Task 11: Learning with Disagreements (LeWiDi)
SEMEVAL 2023
Stop Measuring Calibration When Humans Disagree
EMNLP 2022
Sliced at SemEval-2022 Task 11: Bigger, Better? Massively Multilingual LMs for Multilingual Complex NER on an Academic GPU Budget
NAACL 2022
SkillSpan: Hard and Soft Skill Extraction from English Job Postings
NAACL 2022
Sort by Structure: Language Model Ranking as Dependency Probing
NAACL 2022
Evidence > Intuition: Transferability Estimation for Encoder Selection
EMNLP 2022
Spectral Probing
EMNLP 2022
The “Problem” of Human Label Variation: On Ground Truth in Data, Modeling and Evaluation
EMNLP 2022
On Language Spaces, Scales and Cross-Lingual Transfer of UD Parsers
CONLL 2022
Experimental Standards for Deep Learning in Natural Language Processing Research
EMNLP 2022
CrossRE: A Cross-Domain Dataset for Relation Extraction
EMNLP 2022
On Language Spaces, Scales and Cross-Lingual Transfer of UD Parsers
EMNLP 2022
Probing for Labeled Dependency Trees
ACL 2022
What Do You Mean by Relation Extraction? A Survey on Datasets and Study on Scientific Relation Classification
ACL 2022
Sliced at SemEval-2022 Task 11: Bigger, Better? Massively Multilingual LMs for Multilingual Complex NER on an Academic GPU Budget
SEMEVAL 2022
Genre as Weak Supervision for Cross-lingual Dependency Parsing
EMNLP 2021
Cartography Active Learning
EMNLP 2021
“I’ll be there for you”: The One with Understanding Indirect Answers
EMNLP 2021
Cross-Lingual Cross-Domain Nested Named Entity Evaluation on English Web Texts
IJCNLP 2021
Resources and Evaluations for Danish Entity Resolution
EMNLP 2021
Cross-Lingual Cross-Domain Nested Named Entity Evaluation on English Web Texts
ACL 2021
We Need to Consider Disagreement in Evaluation
ACL 2021
SemEval-2021 Task 12: Learning with Disagreements
ACL 2021
Finding the needle in a haystack: Extraction of Informative COVID-19 Danish Tweets
EMNLP 2021
MultiLexNorm: A Shared Task on Multilingual Lexical Normalization
EMNLP 2021
Massive Choice, Ample Tasks (MaChAmp): A Toolkit for Multi-task Learning in NLP
EACL 2021
On the Effectiveness of Dataset Embeddings in Mono-lingual,Multi-lingual and Zero-shot Conditions
EACL 2021
SemEval-2021 Task 12: Learning with Disagreements
SEMEVAL 2021
From back to the roots into the gated woods: Deep learning for NLP
NAACL 2021
Beyond Black & White: Leveraging Annotator Disagreement via Soft-Label Multi-Task Learning
NAACL 2021
From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken Language Understanding
NAACL 2021
SemEval-2021 Task 12: Learning with Disagreements
IJCNLP 2021
We Need to Consider Disagreement in Evaluation
IJCNLP 2021
Team DiSaster at SemEval-2020 Task 11: Combining BERT and Hand-crafted Features for Identifying Propaganda Techniques in News
SEMEVAL 2020
NLP North at WNUT-2020 Task 2: Pre-training versus Ensembling for Detection of Informative COVID-19 English Tweets
EMNLP 2020
Team DiSaster at SemEval-2020 Task 11: Combining BERT and Hand-crafted Features for Identifying Propaganda Techniques in News
COLING 2020
Buhscitu at SemEval-2020 Task 7: Assessing Humour in Edited News Headlines Using Hand-Crafted Features and Online Knowledge Bases
COLING 2020
Neural Unsupervised Domain Adaptation in NLP—A Survey
COLING 2020
DaN+: Danish Nested Named Entities and Lexical Normalization
COLING 2020
FT Speech: Danish Parliament Speech Corpus
INTERSPEECH 2020
Buhscitu at SemEval-2020 Task 7: Assessing Humour in Edited News Headlines Using Hand-Crafted Features and Online Knowledge Bases
SEMEVAL 2020
Biomedical Event Extraction as Sequence Labeling
EMNLP 2020
Beyond task success: A closer look at jointly learning to see, ask, and GuessWhat
NAACL 2019
At a Glance: The Impact of Gaze Aggregation Views on Syntactic Tagging
EMNLP 2019
MoRTy: Unsupervised Learning of Task-specialized Word Embeddings by Autoencoding
ACL 2019
Psycholinguistics Meets Continual Learning: Measuring Catastrophic Forgetting in Visual Question Answering
ACL 2019
Predicting Authorship and Author Traits from Keystroke Dynamics
NAACL 2018
Bleaching Text: Abstract Features for Cross-lingual Gender Prediction
ACL 2018
Distant Supervision from Disparate Sources for Low-Resource Part-of-Speech Tagging
EMNLP 2018
When Simple n-gram Models Outperform Syntactic Approaches: Discriminating between Dutch and Flemish
COLING 2018
Character-level Supervision for Low-resource POS Tagging
ACL 2018
Strong Baselines for Neural Semi-Supervised Learning under Domain Shift
ACL 2018
Grotoco@SLAM: Second Language Acquisition Modeling with Simple Features, Learners and Task-wise Models
NAACL 2018
Proceedings of the Second Workshop on Computational Modeling of People’s Opinions, Personality, and Emotions in Social Media
NAACL 2018
Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures (Extended Abstract)
IJCAI 2017
Learning to select data for transfer learning with Bayesian Optimization
EMNLP 2017
All-In-1 at IJCNLP-2017 Task 4: Short Text Classification with One Model for All Languages
IJCNLP 2017
Cross-lingual tagger evaluation without test data
EACL 2017
Parsing Universal Dependencies without training
EACL 2017
When is multitask learning effective? Semantic sequence prediction under varying data conditions
EACL 2017
Keystroke dynamics as signal for shallow syntactic parsing
COLING 2016
Semantic Tagging with Deep Residual Networks
COLING 2016
LiMoSINe Pipeline: Multilingual UIMA-based NLP Platform
ACL 2016
Multilingual Part-of-Speech Tagging with Bidirectional Long Short-Term Memory Models and Auxiliary Loss
ACL 2016
Multi-view and multi-task training of RST discourse parsers
COLING 2016
Learning to parse with IAA-weighted loss
NAACL 2015
Do dependency parsing metrics correlate with human judgments?
CONLL 2015
Inverted indexing for cross-lingual NLP
ACL 2015
CPH: Sentiment analysis of Figurative Language on Twitter #easypeasy #not
SEMEVAL 2015
Inverted indexing for cross-lingual NLP
IJCNLP 2015
Semantic Representations for Domain Adaptation: A Case Study on the Tree Kernel-based Method for Relation Extraction
IJCNLP 2015
Semantic Representations for Domain Adaptation: A Case Study on the Tree Kernel-based Method for Relation Extraction
ACL 2015
Mining for unambiguous instances to adapt part-of-speech taggers to new domains
NAACL 2015
Linguistically debatable or just plain wrong?
ACL 2014
Importance weighting and unsupervised domain adaptation of POS taggers: a negative result
EMNLP 2014
Learning part-of-speech taggers with inter-annotator agreement loss
EACL 2014
Adapting taggers to Twitter with not-so-distant supervision
COLING 2014
Selection Bias, Label Bias, and Bias in Ground Truth
COLING 2014
Experiments with crowdsourced re-annotation of a POS tagging data set
ACL 2014
Copenhagen-Malmö: Tree Approximations of Semantic Parsing Problems
SEMEVAL 2014
Opinion Mining on YouTube
ACL 2014
What’s in a p-value in NLP?
CONLL 2014
Embedding Semantic Similarity in Tree Kernels for Domain Adaptation of Relation Extraction
ACL 2013
Effective Measures of Domain Similarity for Parsing
ACL 2011
Reversible Stochastic Attribute-Value Grammars
ACL 2011
Structural Correspondence Learning for Parse Disambiguation
EACL 2009