Patrick Haller

15 papers · 2021–2026 · 7 conferences · across top CS/AI conferences

Achievements

+9 more ↓

🐝 Cross-Pollinator (13) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (6) 🏃 Academic Marathon (5)

🏃 Academic Marathon (5) 🌈 Renaissance Researcher (8) 🗺️ Taxonomy Completionist (38) 👥 Mega-Team (43) 🧬 Topic Evolution ⚡ Prolific Year (5) 💎 Century Club (14) 🗃️ Keyword Collector (67) 🔥 Unstoppable (5)

Conferences

EMNLP (6) ACL (3) NAACL (2) COLING (1) CONLL (1) EACL (1) NIPS (1)

Top co-authors

Alan Akbik (8) Jonas Golde (7) Lena Jäger (4) Lena Ann Jäger (2) Max Ploner (2) Fabio Barth (2) Lena Bolliger (2) Alison Callahan (1) Bo Wang (1) Felix Hamborg (1)

Research topics

Linguistics (1)

Keywords

language model (4) large language model (4) sample efficiency (3) named entity recognition (3) zero-shot learning (3) instruction tuning (3) knowledge distillation (2) reading time (2) eye movement (2) eye tracking (1) model architecture (1) code generation (1) data augmentation (1) factual knowledge (1) language modeling (1) dataset creation (1) uniform information density (1) label shift (1) semantic similarity (1) language production (1)

Papers

FiNERweb: Datasets and Artifacts for Scalable Multilingual Named Entity Recognition EACL 2026 Familiarity: Better Evaluation of Zero-Shot Named Entity Recognition by Quantifying Label Shifts in Synthetic Training Data NAACL 2025 Leveraging In-Context Learning for Political Bias Testing of LLMs ACL 2025 From Data to Knowledge: Evaluating How Efficiently Language Models Learn Facts ACL 2025 Sample-Efficient Language Modeling with Linear Attention and Lightweight Enhancements EMNLP 2025 On the alignment of LM language generation and human language comprehension EMNLP 2024 OpinionGPT: Modelling Explicit Biases in Instruction-Tuned LLMs NAACL 2024 Language models emulate certain cognitive profiles: An investigation of how predictability measures interact with individual differences ACL 2024 BabyHGRN: Exploring RNNs for Sample-Efficient Language Modeling CONLL 2024 PECC: Problem Extraction and Coding Challenges COLING 2024 ScanDL: A Diffusion Model for Generating Synthetic Scanpaths on Texts EMNLP 2023 Fabricator: An Open Source Toolkit for Generating Labeled Training Data with Teacher LLMs EMNLP 2023 Eye-tracking based classification of Mandarin Chinese readers with and without dyslexia using neural sequence models EMNLP 2022 BigBio: A Framework for Data-Centric Biomedical Natural Language Processing NIPS 2022 Revisiting the Uniform Information Density Hypothesis EMNLP 2021