conftrace_

Barbara Plank

173 papers · 2009–2026 · 13 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+17 more ↓

🧭 Keyword Pioneer 🗺️ Taxonomy Completionist (13) 🌈 Renaissance Researcher (6) 🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird

🗺️ Taxonomy Completionist (13) 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🌟 Keyword Trendsetter Combo (3) 🏠 Conference Loyalist (21) 🐺 Lone Wolf (10) 👥 Mega-Team (20) 🧬 Topic Evolution 🏆 Keyword Champion (6) 🤝 Dynamic Duo (29) 🔬 Deep Specialist (28) ⚡ Prolific Year (18) 💎 Century Club (167) 🔥 Unstoppable (13) ❓ The Questioner (23) 📈 Trend Setter 🗃️ Keyword Collector (567)

Conferences

EMNLP (48) ACL (43) EACL (22) COLING (21) NAACL (17) SEMEVAL (8) IJCNLP (6) CONLL (3) AAAI (1) ICLR (1) ICML (1) IJCAI (1) INTERSPEECH (1)

Top co-authors

Rob van der Goot (29) Anders Søgaard (19) Siyao Peng (13) Robert Litschko (12) Dirk Hovy (12) Max Müller-Eberstein (11) Héctor Martínez Alonso (11) Verena Blaschke (11) Massimo Poesio (9) Alexandra Uma (8)

Research topics

Applications (2) Education (2) Linguistics (1) Learning Paradigms (1)

Keywords

large language model (24) transfer learning (15) text classification (13) multi-task learning (12) natural language processing (12) cross-lingual transfer (10) dependency parsing (10) named entity recognition (10) part-of-speech tagging (10) multilingual nlp (9) annotation disagreement (8) zero-shot learning (8) low-resource language (7) language model (7) human label variation (7) natural language inference (6) model evaluation (6) uncertainty quantification (6) domain adaptation (6) neural network (6)

Papers

Survey Response Generation: Generating Closed-Ended Survey Responses In-Silico with Large Language Models ACL 2026 If Probable, Then Acceptable? Understanding Conditional Acceptability Judgments in Large Language Models EACL 2026 Too Open for Opinion? Embracing Open-Endedness in Large Language Models for Social Simulation EACL 2026 Controlling Reading Ease with Gaze-Guided Text Generation EACL 2026 When Meanings Meet: Investigating the Emergence and Quality of Shared Concept Spaces during Multilingual Language Model Training EACL 2026 Standard-to-Dialect Transfer Trends Differ across Text and Speech: A Case Study on Intent and Topic Classification in German Dialects ACL 2026 Reason to Rote: Rethinking Memorization in Reasoning EMNLP 2025 Relevant for the Right Reasons? Investigating Lexical Biases in Zero-Shot and Instruction-Tuned Rerankers EMNLP 2025 BlackboxNLP-2025 MIB Shared Task: Exploring Ensemble Strategies for Circuit Localization Methods EMNLP 2025 Mind the Uncertainty in Human Disagreement: Evaluating Discrepancies Between Model Predictions and Human Responses in VQA AAAI 2025 Algorithmic Fidelity of Large Language Models in Generating Synthetic German Public Opinions: A Case Study ACL 2025 Pragmatics in the Era of Large Language Models: A Survey on Datasets, Evaluation, Opportunities and Challenges ACL 2025 Circuit Compositions: Exploring Modular Structures in Transformer-Based Language Models ACL 2025 Probing LLMs for Multilingual Discourse Generalization Through a Unified Label Set ACL 2025 What’s the Difference? Supporting Users in Identifying the Effects of Prompt and Model Changes Through Token Patterns ACL 2025 LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks ACL 2025 A Rose by Any Other Name: LLM-Generated Explanations Are Good Proxies for Human Explanations to Collect Label Distributions on NLI ACL 2025 Do LLMs Give Psychometrically Plausible Responses in Educational Assessments? ACL 2025 Better Aligned with Survey Respondents or Training Data? Unveiling Political Leanings of LLMs on U.S. Supreme Court Cases ACL 2025 MAKIEval: A Multilingual Automatic WiKidata-based Framework for Cultural Awareness Evaluation for LLMs EMNLP 2025 Make Every Letter Count: Building Dialect Variation Dictionaries from Monolingual Corpora EMNLP 2025 Evaluating Large Language Models for Cross-Lingual Retrieval EMNLP 2025 What Media Frames Reveal About Stance: A Dataset and Study about Memes in Climate Change Discourse EMNLP 2025 Tracing Multilingual Factual Knowledge Acquisition in Pretraining EMNLP 2025 Crossing Domains without Labels: Distant Supervision for Term Extraction EMNLP 2025 LiTEx: A Linguistic Taxonomy of Explanations for Understanding Within-Label Variation in Natural Language Inference EMNLP 2025 Threading the Needle: Reweaving Chain-of-Thought Reasoning to Explain Human Label Variation EMNLP 2025 The Validation Gap: A Mechanistic Analysis of How Language Models Compute Arithmetic but Fail to Validate It EMNLP 2025 Disentangling Subjectivity and Uncertainty for Hate Speech Annotation and Modeling using Gaze EMNLP 2025 Evaluating Pixel Language Models on Non-Standardized Languages COLING 2025 Cross-Dialect Information Retrieval: Information Access in Low-Resource and High-Variance Languages COLING 2025 KARRIEREWEGE: A large scale Career Path Prediction Dataset COLING 2025 Neural Text Normalization for Luxembourgish Using Real-Life Variation Data COLING 2025 Improving Dialectal Slot and Intent Detection with Auxiliary Tasks: A Multi-Dialectal Bavarian Case Study COLING 2025 Add Noise, Tasks, or Layers? MaiNLP at the VarDial 2025 Shared Task on Norwegian Dialectal Slot and Intent Detection COLING 2025 RAcQUEt: Unveiling the Dangers of Overlooked Referential Ambiguity in Visual LLMs EMNLP 2025 M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis EMNLP 2025 Dialetto, ma Quanto Dialetto? Transcribing and Evaluating Dialects on a Continuum NAACL 2025 Lost in Inference: Rediscovering the Role of Natural Language Inference for Large Language Models NAACL 2025 Surgical, Cheap, and Flexible: Mitigating False Refusal in Language Models via Single Vector Ablation ICLR 2025 LeWiDi-2025 at NLPerspectives: Third Edition of the Learning with Disagreements Shared Task EMNLP 2025 BoN Appetit Team at LeWiDi-2025: Best-of-N Test-time Scaling Can Not Stomach Annotation Disagreements (Yet) EMNLP 2025 Aligning NLP Models with Target Population Perspectives using PAIR: Population-Aligned Instance Replication EMNLP 2025 Revisiting Active Learning under (Human) Label Variation EMNLP 2025 Deep Learning-based Computational Job Market Analysis: A Survey on Skill Extraction and Classification from Job Postings EACL 2024 Exploring the Robustness of Task-oriented Dialogue Systems for Colloquial German Varieties EACL 2024 How to Encode Domain Information in Relation Classification COLING 2024 What’s wrong with your model? A Quantitative Analysis of Relation Classification NAACL 2024 MaiNLP at SemEval-2024 Task 1: Analyzing Source Language Selection in Cross-Lingual Textual Relatedness NAACL 2024 Universal NER: A Gold-Standard Multilingual Named Entity Recognition Benchmark NAACL 2024 Position: Insights from Survey Methodology can Improve Training Data ICML 2024 IndirectQA: Understanding Indirect Answers to Implicit Polar Questions in French and Spanish COLING 2024 MaiBaam: A Multi-Dialectal Bavarian Universal Dependency Treebank COLING 2024 Sebastian, Basti, Wastl?! Recognizing Named Entities in Bavarian Dialectal Data COLING 2024 Slot and Intent Detection Resources for Bavarian and Lithuanian: Assessing Translations vs Natural Queries to Digital Assistants COLING 2024 MultiClimate: Multimodal Stance Detection on Climate Change Videos EMNLP 2024 To Know or Not To Know? Analyzing Self-Consistency of Large Language Models under Ambiguity EMNLP 2024 “Seeing the Big through the Small”: Can LLMs Approximate Human Judgment Distributions on NLI from a Few Explanations? EMNLP 2024 The Potential and Challenges of Evaluating Attitudes, Opinions, and Values in Large Language Models EMNLP 2024 EEVEE: An Easy Annotation Tool for Natural Language Processing EACL 2024 Donkii: Characterizing and Detecting Errors in Instruction-Tuning Datasets EACL 2024 Entity Linking in the Job Market Domain EACL 2024 Interpreting Predictive Probabilities: Model Confidence or Human Label Variation? EACL 2024 NNOSE: Nearest Neighbor Occupational Skill Extraction EACL 2024 Through the Lens of Split Vote: Exploring Disagreement, Difficulty and Calibration in Legal Case Outcome Classification ACL 2024 VariErr NLI: Separating Annotation Error from Human Label Variation ACL 2024 Comparing Inferential Strategies of Humans and Large Language Models in Deductive Reasoning ACL 2024 What Do Dialect Speakers Want? A Survey of Attitudes Towards Language Technology for German Dialects ACL 2024 “My Answer is C”: First-Token Probabilities Do Not Match Text Answers in Instruction-Tuned Language Models ACL 2024 CLIMATELI: Evaluating Entity Linking on Climate Change Data ACL 2024 Liar, Liar, Logical Mire: A Benchmark for Suppositional Reasoning in Large Language Models EMNLP 2024 Different Tastes of Entities: Investigating Human Label Variation in Named Entity Annotations EACL 2024 More Labels or Cases? Assessing Label Variation in Natural Language Inference EACL 2024 MaiNLP at SemEval-2024 Task 1: Analyzing Source Language Selection in Cross-Lingual Textual Relatedness SEMEVAL 2024 ESCOXLM-R: Multilingual Taxonomy-driven Pre-training for the Job Market Domain ACL 2023 How to Distill your BERT: An Empirical Study on the Impact of Weight Initialisation and Distillation Objectives ACL 2023 Boosting Zero-shot Cross-lingual Retrieval by Training on Artificially Code-Switched Data ACL 2023 Silver Syntax Pre-training for Cross-Domain Relation Extraction ACL 2023 ActiveAED: A Human in the Loop Improves Annotation Error Detection ACL 2023 SemEval-2023 Task 11: Learning with Disagreements (LeWiDi) ACL 2023 Does Manipulating Tokenization Aid Cross-Lingual Transfer? A Study on POS Tagging for Non-Standardized Languages EACL 2023 Findings of the VarDial Evaluation Campaign 2023 EACL 2023 Establishing Trustworthiness: Rethinking Tasks and Model Evaluation EMNLP 2023 ACTOR: Active Learning with Annotator-specific Classification Heads to Embrace Human Label Variation EMNLP 2023 From Dissonance to Insights: Dissecting Disagreements in Rationale Construction for Case Outcome Classification EMNLP 2023 What Comes Next? Evaluating Uncertainty in Neural Text Generators Against Human Production Variability EMNLP 2023 Subspace Chronicles: How Linguistic Information Emerges, Shifts and Interacts during Language Model Training EMNLP 2023 SemEval-2023 Task 11: Learning with Disagreements (LeWiDi) SEMEVAL 2023 Stop Measuring Calibration When Humans Disagree EMNLP 2022 Sliced at SemEval-2022 Task 11: Bigger, Better? Massively Multilingual LMs for Multilingual Complex NER on an Academic GPU Budget NAACL 2022 SkillSpan: Hard and Soft Skill Extraction from English Job Postings NAACL 2022 Sort by Structure: Language Model Ranking as Dependency Probing NAACL 2022 Evidence > Intuition: Transferability Estimation for Encoder Selection EMNLP 2022 Spectral Probing EMNLP 2022 The “Problem” of Human Label Variation: On Ground Truth in Data, Modeling and Evaluation EMNLP 2022 On Language Spaces, Scales and Cross-Lingual Transfer of UD Parsers CONLL 2022 Experimental Standards for Deep Learning in Natural Language Processing Research EMNLP 2022 CrossRE: A Cross-Domain Dataset for Relation Extraction EMNLP 2022 On Language Spaces, Scales and Cross-Lingual Transfer of UD Parsers EMNLP 2022 Probing for Labeled Dependency Trees ACL 2022 What Do You Mean by Relation Extraction? A Survey on Datasets and Study on Scientific Relation Classification ACL 2022 Sliced at SemEval-2022 Task 11: Bigger, Better? Massively Multilingual LMs for Multilingual Complex NER on an Academic GPU Budget SEMEVAL 2022 Genre as Weak Supervision for Cross-lingual Dependency Parsing EMNLP 2021 Cartography Active Learning EMNLP 2021 “I’ll be there for you”: The One with Understanding Indirect Answers EMNLP 2021 Cross-Lingual Cross-Domain Nested Named Entity Evaluation on English Web Texts IJCNLP 2021 Resources and Evaluations for Danish Entity Resolution EMNLP 2021 Cross-Lingual Cross-Domain Nested Named Entity Evaluation on English Web Texts ACL 2021 We Need to Consider Disagreement in Evaluation ACL 2021 SemEval-2021 Task 12: Learning with Disagreements ACL 2021 Finding the needle in a haystack: Extraction of Informative COVID-19 Danish Tweets EMNLP 2021 MultiLexNorm: A Shared Task on Multilingual Lexical Normalization EMNLP 2021 Massive Choice, Ample Tasks (MaChAmp): A Toolkit for Multi-task Learning in NLP EACL 2021 On the Effectiveness of Dataset Embeddings in Mono-lingual,Multi-lingual and Zero-shot Conditions EACL 2021 SemEval-2021 Task 12: Learning with Disagreements SEMEVAL 2021 From back to the roots into the gated woods: Deep learning for NLP NAACL 2021 Beyond Black & White: Leveraging Annotator Disagreement via Soft-Label Multi-Task Learning NAACL 2021 From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken Language Understanding NAACL 2021 SemEval-2021 Task 12: Learning with Disagreements IJCNLP 2021 We Need to Consider Disagreement in Evaluation IJCNLP 2021 Team DiSaster at SemEval-2020 Task 11: Combining BERT and Hand-crafted Features for Identifying Propaganda Techniques in News SEMEVAL 2020 NLP North at WNUT-2020 Task 2: Pre-training versus Ensembling for Detection of Informative COVID-19 English Tweets EMNLP 2020 Team DiSaster at SemEval-2020 Task 11: Combining BERT and Hand-crafted Features for Identifying Propaganda Techniques in News COLING 2020 Buhscitu at SemEval-2020 Task 7: Assessing Humour in Edited News Headlines Using Hand-Crafted Features and Online Knowledge Bases COLING 2020 Neural Unsupervised Domain Adaptation in NLP—A Survey COLING 2020 DaN+: Danish Nested Named Entities and Lexical Normalization COLING 2020 FT Speech: Danish Parliament Speech Corpus INTERSPEECH 2020 Buhscitu at SemEval-2020 Task 7: Assessing Humour in Edited News Headlines Using Hand-Crafted Features and Online Knowledge Bases SEMEVAL 2020 Biomedical Event Extraction as Sequence Labeling EMNLP 2020 Beyond task success: A closer look at jointly learning to see, ask, and GuessWhat NAACL 2019 At a Glance: The Impact of Gaze Aggregation Views on Syntactic Tagging EMNLP 2019 MoRTy: Unsupervised Learning of Task-specialized Word Embeddings by Autoencoding ACL 2019 Psycholinguistics Meets Continual Learning: Measuring Catastrophic Forgetting in Visual Question Answering ACL 2019 Predicting Authorship and Author Traits from Keystroke Dynamics NAACL 2018 Bleaching Text: Abstract Features for Cross-lingual Gender Prediction ACL 2018 Distant Supervision from Disparate Sources for Low-Resource Part-of-Speech Tagging EMNLP 2018 When Simple n-gram Models Outperform Syntactic Approaches: Discriminating between Dutch and Flemish COLING 2018 Character-level Supervision for Low-resource POS Tagging ACL 2018 Strong Baselines for Neural Semi-Supervised Learning under Domain Shift ACL 2018 Grotoco@SLAM: Second Language Acquisition Modeling with Simple Features, Learners and Task-wise Models NAACL 2018 Proceedings of the Second Workshop on Computational Modeling of People’s Opinions, Personality, and Emotions in Social Media NAACL 2018 Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures (Extended Abstract) IJCAI 2017 Learning to select data for transfer learning with Bayesian Optimization EMNLP 2017 All-In-1 at IJCNLP-2017 Task 4: Short Text Classification with One Model for All Languages IJCNLP 2017 Cross-lingual tagger evaluation without test data EACL 2017 Parsing Universal Dependencies without training EACL 2017 When is multitask learning effective? Semantic sequence prediction under varying data conditions EACL 2017 Keystroke dynamics as signal for shallow syntactic parsing COLING 2016 Semantic Tagging with Deep Residual Networks COLING 2016 LiMoSINe Pipeline: Multilingual UIMA-based NLP Platform ACL 2016 Multilingual Part-of-Speech Tagging with Bidirectional Long Short-Term Memory Models and Auxiliary Loss ACL 2016 Multi-view and multi-task training of RST discourse parsers COLING 2016 Learning to parse with IAA-weighted loss NAACL 2015 Do dependency parsing metrics correlate with human judgments? CONLL 2015 Inverted indexing for cross-lingual NLP ACL 2015 CPH: Sentiment analysis of Figurative Language on Twitter #easypeasy #not SEMEVAL 2015 Inverted indexing for cross-lingual NLP IJCNLP 2015 Semantic Representations for Domain Adaptation: A Case Study on the Tree Kernel-based Method for Relation Extraction IJCNLP 2015 Semantic Representations for Domain Adaptation: A Case Study on the Tree Kernel-based Method for Relation Extraction ACL 2015 Mining for unambiguous instances to adapt part-of-speech taggers to new domains NAACL 2015 Linguistically debatable or just plain wrong? ACL 2014 Importance weighting and unsupervised domain adaptation of POS taggers: a negative result EMNLP 2014 Learning part-of-speech taggers with inter-annotator agreement loss EACL 2014 Adapting taggers to Twitter with not-so-distant supervision COLING 2014 Selection Bias, Label Bias, and Bias in Ground Truth COLING 2014 Experiments with crowdsourced re-annotation of a POS tagging data set ACL 2014 Copenhagen-Malmö: Tree Approximations of Semantic Parsing Problems SEMEVAL 2014 Opinion Mining on YouTube ACL 2014 What’s in a p-value in NLP? CONLL 2014 Embedding Semantic Similarity in Tree Kernels for Domain Adaptation of Relation Extraction ACL 2013 Effective Measures of Domain Similarity for Parsing ACL 2011 Reversible Stochastic Attribute-Value Grammars ACL 2011 Structural Correspondence Learning for Parse Disambiguation EACL 2009