Patrick Haller
15 papers · 2021–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+9 more ↓ Show less ↑
π Cross-Pollinator (13) π§ Keyword Pioneer π Interdisciplinary Bridge π Conference Polyglot (6) π Academic Marathon (5)
π
Academic Marathon
(5)
π
Renaissance Researcher
(8)
πΊοΈ
Taxonomy Completionist
(38)
π₯
Mega-Team
(43)
π§¬
Topic Evolution
β‘
Prolific Year
(5)
π
Century Club
(14)
ποΈ
Keyword Collector
(67)
π₯
Unstoppable
(5)
Conferences
EMNLP (6)
ACL (3)
NAACL (2)
COLING (1)
CONLL (1)
EACL (1)
NIPS (1)
Top co-authors
Research topics
Keywords
language model
(4)
large language model
(4)
sample efficiency
(3)
named entity recognition
(3)
zero-shot learning
(3)
instruction tuning
(3)
knowledge distillation
(2)
reading time
(2)
eye movement
(2)
eye tracking
(1)
model architecture
(1)
code generation
(1)
data augmentation
(1)
factual knowledge
(1)
language modeling
(1)
dataset creation
(1)
uniform information density
(1)
label shift
(1)
semantic similarity
(1)
language production
(1)
Papers
FiNERweb: Datasets and Artifacts for Scalable Multilingual Named Entity Recognition
EACL 2026
Familiarity: Better Evaluation of Zero-Shot Named Entity Recognition by Quantifying Label Shifts in Synthetic Training Data
NAACL 2025
Leveraging In-Context Learning for Political Bias Testing of LLMs
ACL 2025
From Data to Knowledge: Evaluating How Efficiently Language Models Learn Facts
ACL 2025
Sample-Efficient Language Modeling with Linear Attention and Lightweight Enhancements
EMNLP 2025
On the alignment of LM language generation and human language comprehension
EMNLP 2024
OpinionGPT: Modelling Explicit Biases in Instruction-Tuned LLMs
NAACL 2024
Language models emulate certain cognitive profiles: An investigation of how predictability measures interact with individual differences
ACL 2024
BabyHGRN: Exploring RNNs for Sample-Efficient Language Modeling
CONLL 2024
PECC: Problem Extraction and Coding Challenges
COLING 2024
ScanDL: A Diffusion Model for Generating Synthetic Scanpaths on Texts
EMNLP 2023
Fabricator: An Open Source Toolkit for Generating Labeled Training Data with Teacher LLMs
EMNLP 2023
Eye-tracking based classification of Mandarin Chinese readers with and without dyslexia using neural sequence models
EMNLP 2022
BigBio: A Framework for Data-Centric Biomedical Natural Language Processing
NIPS 2022
Revisiting the Uniform Information Density Hypothesis
EMNLP 2021