Alex Warstadt
33 papers · 2019–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
π Academic Marathon (6) π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (7) π Cross-Pollinator (7)
π£
Hot Topic Early Bird
π
Conference Polyglot
(7)
π
Academic Marathon
(6)
π§¬
Topic Evolution
π₯
Mega-Team
(26)
π₯
Unstoppable
(7)
β‘
Prolific Year
(5)
π
Century Club
(29)
β
The Questioner
(6)
ποΈ
Keyword Collector
(129)
Conferences
ACL (12)
EMNLP (9)
CONLL (5)
IJCNLP (3)
EACL (2)
COLING (1)
NAACL (1)
Top co-authors
Keywords
language model
(6)
natural language inference
(5)
pretrained language model
(5)
question answering
(4)
natural language understanding
(4)
speech prosody
(3)
transformer model
(3)
information theory
(3)
large language model
(3)
data quality
(2)
representation learning
(2)
benchmark evaluation
(2)
cognitive modeling
(2)
commonsense knowledge
(2)
language acquisition
(2)
semantic analysis
(2)
probing analysis
(2)
language modeling
(2)
language model evaluation
(2)
bert model
(2)
Papers
Dual Alignment Between Language Model Layers and Human Sentence Processing
ACL 2026
Itβs Not What You Say, Itβs How You Say It: Evaluating LLM Responses to Expressions of Belief
ACL 2026
BabyBabelLM: A Multilingual Benchmark of Developmentally Plausible Training Data
EACL 2026
What Do Prosody and Text Convey? Characterizing How Meaningful Information is Distributed Across Multiple Channels
ACL 2026
A Distributional Perspective on Word Learning in Neural Language Models
NAACL 2025
Using Information Theory to Characterize Prosodic Typology: The Case of Tone, Pitch-Accent and Stress-Accent
ACL 2025
The time scale of redundancy between prosody and linguistic context
ACL 2025
The Harmonic Structure of Information Contours
ACL 2025
Findings of the Third BabyLM Challenge: Accelerating Language Modeling Research with Cognitively Plausible Data
EMNLP 2025
Automatic Annotation of Grammaticality in Child-Caregiver Conversations
COLING 2024
Findings of the Second BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora
CONLL 2024
Surprise! Uniform Information Density Isnβt the Whole Story: Predicting Surprisal Contours in Long-form Discourse
EMNLP 2024
Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora
CONLL 2023
WhisBERT: Multimodal Text-Audio Language Modeling on 100M Words
CONLL 2023
WhisBERT: Multimodal Text-Audio Language Modeling on 100M Words
EMNLP 2023
Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora
EMNLP 2023
Generalizing Backpropagation for Gradient-Based Interpretability
ACL 2023
Quantifying the redundancy between prosody and text
EMNLP 2023
Reconstruction Probing
ACL 2023
Entailment Semantics Can Be Extracted from an Ideal Language Model
EMNLP 2022
Entailment Semantics Can Be Extracted from an Ideal Language Model
CONLL 2022
What Makes Reading Comprehension Questions Difficult?
ACL 2022
When Do You Need Billions of Words of Pretraining Data?
ACL 2021
CLiMP: A Benchmark for Chinese Language Model Evaluation
EACL 2021
What Ingredients Make for an Effective Crowdsourcing Protocol for Difficult NLU Data Collection Tasks?
ACL 2021
NOPE: A Corpus of Naturally-Occurring Presuppositions in English
CONLL 2021
What Ingredients Make for an Effective Crowdsourcing Protocol for Difficult NLU Data Collection Tasks?
IJCNLP 2021
When Do You Need Billions of Words of Pretraining Data?
IJCNLP 2021
NOPE: A Corpus of Naturally-Occurring Presuppositions in English
EMNLP 2021
Are Natural Language Inference Models IMPPRESsive? Learning IMPlicature and PRESupposition
ACL 2020
Learning Which Features Matter: RoBERTa Acquires a Preference for Linguistic Generalizations (Eventually)
EMNLP 2020
Investigating BERTβs Knowledge of Language: Five Analysis Methods with NPIs
IJCNLP 2019
Investigating BERTβs Knowledge of Language: Five Analysis Methods with NPIs
EMNLP 2019