Alex Warstadt

33 papers · 2019–2026 · 7 conferences · across top CS/AI conferences

Achievements

+10 more ↓

🏃 Academic Marathon (6) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (7) 🐝 Cross-Pollinator (7)

🐣 Hot Topic Early Bird 🌍 Conference Polyglot (7) 🏃 Academic Marathon (6) 🧬 Topic Evolution 👥 Mega-Team (26) 🔥 Unstoppable (7) ⚡ Prolific Year (5) 💎 Century Club (29) ❓ The Questioner (6) 🗃️ Keyword Collector (129)

Conferences

ACL (12) EMNLP (9) CONLL (5) IJCNLP (3) EACL (2) COLING (1) NAACL (1)

Top co-authors

Samuel R. Bowman (9) Ryan Cotterell (9) Tal Linzen (8) Ethan Gotlieb Wilcox (7) Ethan Wilcox (6) Leshem Choshen (5) Adina Williams (5) Aaron Mueller (4) Alicia Parrish (4) Lukas Wolf (4)

Keywords

language model (6) natural language inference (5) pretrained language model (5) question answering (4) natural language understanding (4) speech prosody (3) transformer model (3) information theory (3) large language model (3) data quality (2) representation learning (2) benchmark evaluation (2) cognitive modeling (2) commonsense knowledge (2) language acquisition (2) semantic analysis (2) probing analysis (2) language modeling (2) language model evaluation (2) bert model (2)

Papers

Dual Alignment Between Language Model Layers and Human Sentence Processing ACL 2026 It’s Not What You Say, It’s How You Say It: Evaluating LLM Responses to Expressions of Belief ACL 2026 BabyBabelLM: A Multilingual Benchmark of Developmentally Plausible Training Data EACL 2026 What Do Prosody and Text Convey? Characterizing How Meaningful Information is Distributed Across Multiple Channels ACL 2026 A Distributional Perspective on Word Learning in Neural Language Models NAACL 2025 Using Information Theory to Characterize Prosodic Typology: The Case of Tone, Pitch-Accent and Stress-Accent ACL 2025 The time scale of redundancy between prosody and linguistic context ACL 2025 The Harmonic Structure of Information Contours ACL 2025 Findings of the Third BabyLM Challenge: Accelerating Language Modeling Research with Cognitively Plausible Data EMNLP 2025 Automatic Annotation of Grammaticality in Child-Caregiver Conversations COLING 2024 Findings of the Second BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora CONLL 2024 Surprise! Uniform Information Density Isn’t the Whole Story: Predicting Surprisal Contours in Long-form Discourse EMNLP 2024 Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora CONLL 2023 WhisBERT: Multimodal Text-Audio Language Modeling on 100M Words CONLL 2023 WhisBERT: Multimodal Text-Audio Language Modeling on 100M Words EMNLP 2023 Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora EMNLP 2023 Generalizing Backpropagation for Gradient-Based Interpretability ACL 2023 Quantifying the redundancy between prosody and text EMNLP 2023 Reconstruction Probing ACL 2023 Entailment Semantics Can Be Extracted from an Ideal Language Model EMNLP 2022 Entailment Semantics Can Be Extracted from an Ideal Language Model CONLL 2022 What Makes Reading Comprehension Questions Difficult? ACL 2022 When Do You Need Billions of Words of Pretraining Data? ACL 2021 CLiMP: A Benchmark for Chinese Language Model Evaluation EACL 2021 What Ingredients Make for an Effective Crowdsourcing Protocol for Difficult NLU Data Collection Tasks? ACL 2021 NOPE: A Corpus of Naturally-Occurring Presuppositions in English CONLL 2021 What Ingredients Make for an Effective Crowdsourcing Protocol for Difficult NLU Data Collection Tasks? IJCNLP 2021 When Do You Need Billions of Words of Pretraining Data? IJCNLP 2021 NOPE: A Corpus of Naturally-Occurring Presuppositions in English EMNLP 2021 Are Natural Language Inference Models IMPPRESsive? Learning IMPlicature and PRESupposition ACL 2020 Learning Which Features Matter: RoBERTa Acquires a Preference for Linguistic Generalizations (Eventually) EMNLP 2020 Investigating BERT’s Knowledge of Language: Five Analysis Methods with NPIs IJCNLP 2019 Investigating BERT’s Knowledge of Language: Five Analysis Methods with NPIs EMNLP 2019