Ryan Cotterell

225 papers · 2014–2026 · 13 conferences · across top CS/AI conferences

Achievements

+19 more ↓

🗺️ Taxonomy Completionist (14) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (5) 🐣 Hot Topic Early Bird

🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird 🗺️ Taxonomy Completionist (14) 🌟 Keyword Trendsetter Combo (6) 🏠 Conference Loyalist (61) 👑 Domain Dominant (56) 🤝 Dynamic Duo (36) 🏆 Grand Slam 🏆 Keyword Champion (2) 👑 Triple Crown 👥 Mega-Team (58) 🌱 Topic Pioneer 🔬 Deep Specialist (26) ❓ The Questioner (13) 🗃️ Keyword Collector (706) ⚡ Prolific Year (28) 📈 Trend Setter 🔥 Unstoppable (12) 💎 Century Club (219)

Conferences

ACL (70) EMNLP (63) NAACL (36) IJCNLP (15) EACL (11) CONLL (8) ICLR (7) ICML (6) COLING (3) NIPS (3) AAAI (1) AISTATS (1) CVPR (1)

Top co-authors

Tiago Pimentel (36) Tim Vieira (31) Clara Meister (29) Jason Eisner (24) Anej Svete (18) Adina Williams (17) Ekaterina Vylomova (16) Mrinmaya Sachan (15) Afra Amini (13) Niklas Stoehr (13)

Research topics

Linguistics (10) Statistics (3) Applications (2) Understanding (1)

Keywords

language model (34) information theory (19) neural network (16) representation learning (15) morphological inflection (11) recurrent neural network (11) language modeling (11) word embedding (10) dependency parsing (10) low-resource language (9) bias mitigation (8) formal language (8) gender bia (7) cross-linguistic analysis (7) mutual information (7) probability distribution (6) inductive bia (6) multilingual nlp (6) probabilistic modeling (6) latent variable model (6)

Papers

Prefix Parsing is Just Parsing ACL 2026 Probing for Reading Times ACL 2026 On the Proper Treatment of Units in Surprisal Theory ACL 2026 Characterizing the Expressivity of Local Attention in Transformers ACL 2026 Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo ICLR 2025 Taxonomy-Aware Evaluation of Vision-Language Models CVPR 2025 Controllable Context Sensitivity and the Knob Behind It ICLR 2025 Training Neural Networks as Recognizers of Formal Languages ICLR 2025 The Foundations of Tokenization: Statistical and Computational Concerns ICLR 2025 Gumbel Counterfactual Generation From Language Models ICLR 2025 Variational Best-of-N Alignment ICLR 2025 Syntactic Control of Language Models by Posterior Inference ACL 2025 Unique Hard Attention: A Tale of Two Sides ACL 2025 The Harmonic Structure of Information Contours ACL 2025 Pointwise Mutual Information as a Performance Gauge for Retrieval-Augmented Generation NAACL 2025 How Persuasive Is Your Context? EMNLP 2025 A Spatio-Temporal Point Process for Fine-Grained Modeling of Reading Behavior ACL 2025 Information Locality as an Inductive Bias for Neural Language Models ACL 2025 A Distributional Perspective on Word Learning in Neural Language Models NAACL 2025 A Practical Method for Generating String Counterfactuals NAACL 2025 From Language Models over Tokens to Language Models over Characters ICML 2025 Language Models over Canonical Byte-Pair Encodings ICML 2025 Findings of the Third BabyLM Challenge: Accelerating Language Modeling Research with Cognitively Plausible Data EMNLP 2025 The Role of n-gram Smoothing in the Age of Neural Networks NAACL 2024 Transformers Can Represent n-gram Language Models NAACL 2024 Lower Bounds on the Expressivity of Recurrent Neural Language Models NAACL 2024 Representation Surgery: Theory and Practice of Affine Steering ICML 2024 Do Language Models Exhibit the Same Cognitive Biases in Problem Solving as Human Learners? ICML 2024 On Affine Homotopy between Language Encoders NIPS 2024 What Do Language Models Learn in Context? The Structured Task Hypothesis. ACL 2024 On the Representational Capacity of Neural Language Models with Chain-of-Thought Reasoning ACL 2024 Context versus Prior Knowledge in Language Models ACL 2024 What Languages are Easy to Language-Model? A Perspective from Learning Probabilistic Regular Languages ACL 2024 Computational Expressivity of Neural Language Models ACL 2024 On Efficiently Representing Regular Languages as RNNs ACL 2024 Direct Preference Optimization with an Offset ACL 2024 When is a Language Process a Language Model? ACL 2024 Principled Gradient-Based MCMC for Conditional Sampling of Text ICML 2024 Findings of the Second BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora CONLL 2024 Generalized Measures of Anticipation and Responsivity in Online Language Processing EMNLP 2024 Activation Scaling for Steering and Interpreting Language Models EMNLP 2024 A Transformer with Stack Attention NAACL 2024 Towards Explainability in Legal Outcome Prediction Models NAACL 2024 Efficiently Computing Susceptibility to Context in Language Models EMNLP 2024 Surprise! Uniform Information Density Isn’t the Whole Story: Predicting Surprisal Contours in Long-form Discourse EMNLP 2024 On the Proper Treatment of Tokenization in Psycholinguistics EMNLP 2024 A Probability–Quality Trade-off in Aligned Language Models and its Relation to Sampling Adaptors EMNLP 2024 Can Transformers Learn n-gram Language Models? EMNLP 2024 Reverse-Engineering the Reader EMNLP 2024 An L* Algorithm for Deterministic Weighted Regular Languages EMNLP 2024 On the Role of Context in Reading Time Prediction EMNLP 2024 Language Model Quality Correlates with Psychometric Predictive Power in Multiple Languages EMNLP 2023 Generalizing Backpropagation for Gradient-Based Interpretability ACL 2023 A Fast Algorithm for Computing Prefix Probabilities ACL 2023 Hexatagging: Projective Dependency Parsing as Tagging ACL 2023 Generating Text from Language Models ACL 2023 A Formal Perspective on Byte-Pair Encoding ACL 2023 Revisiting the Optimality of Word Lengths EMNLP 2023 On the Usefulness of Embeddings, Clusters and Strings for Text Generation Evaluation ICLR 2023 Efficient Algorithms for Recognizing Weighted Tree-Adjoining Languages EMNLP 2023 On the Representational Capacity of Recurrent Neural Language Models EMNLP 2023 On the Intersection of Context-Free and Regular Languages EACL 2023 An Exploration of Left-Corner Transformations EMNLP 2023 A Measure-Theoretic Characterization of Tight Language Models ACL 2023 Quantifying the redundancy between prosody and text EMNLP 2023 Recurrent Neural Language Models as Probabilistic Finite-state Automata EMNLP 2023 The Ordered Matrix Dirichlet for State-Space Models AISTATS 2023 Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora EMNLP 2023 Controlled Text Generation with Natural Language Instructions ICML 2023 Structured Voronoi Sampling NIPS 2023 Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora CONLL 2023 Sentiment as an Ordinal Latent Variable EACL 2023 LEACE: Perfect linear concept erasure in closed form NIPS 2023 Linear-Time Modeling of Linguistic Structure: An Order-Theoretic Perspective EMNLP 2023 A Latent-Variable Model for Intrinsic Probing AAAI 2023 On the Efficacy of Sampling Adapters ACL 2023 Efficient Semiring-Weighted Earley Parsing ACL 2023 An Ordinal Latent Variable Model of Conflict Intensity ACL 2023 Tokenization and the Noiseless Channel ACL 2023 Convergence and Diversity in the Control Hierarchy ACL 2023 Discourse-Centric Evaluation of Document-level Machine Translation with a New Densely Annotated Parallel Corpus of Novels ACL 2023 Log-linear Guardedness and its Implications ACL 2023 Probing as Quantifying Inductive Bias ACL 2022 Probing for the Usage of Grammatical Number ACL 2022 Analyzing Wrap-Up Effects through an Information-Theoretic Lens ACL 2022 On the probability–quality paradox in language generation ACL 2022 Estimating the Entropy of Linguistic Distributions ACL 2022 Probing via Prompting NAACL 2022 SIGMORPHON–UniMorph 2022 Shared Task 0: Generalization and Typologically Diverse Morphological Inflection NAACL 2022 The SIGTYP 2022 Shared Task on the Prediction of Cognate Reflexes NAACL 2022 BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation NAACL 2022 Mutual Information Alleviates Hallucinations in Abstractive Summarization EMNLP 2022 Adversarial Concept Erasure in Kernel Space EMNLP 2022 Same Neurons, Different Languages: Probing Morphosyntax in Multilingual Pre-trained Models NAACL 2022 A Structured Span Selector NAACL 2022 Equivariant Transduction through Invariant Alignment COLING 2022 Benchmarking Compositionality with Formal Languages COLING 2022 Exact Paired-Permutation Testing for Structured Test Statistics NAACL 2022 Algorithms for Acyclic Weighted Finite-State Automata with Failure Arcs EMNLP 2022 On Parsing as Tagging EMNLP 2022 Algorithms for Weighted Pushdown Automata EMNLP 2022 The Architectural Bottleneck Principle EMNLP 2022 Autoregressive Structured Prediction with Language Models EMNLP 2022 The SIGMORPHON 2022 Shared Task on Morpheme Segmentation NAACL 2022 On the Machine Learning of Ethical Judgments from Natural Language NAACL 2022 Do Syntactic Probes Probe Syntax? Experiments with Jabberwocky Probing NAACL 2021 Examining the Inductive Bias of Neural Language Models with Artificial Languages ACL 2021 On Finding the K-best Non-projective Dependency Trees ACL 2021 A Cognitive Regularizer for Language Modeling ACL 2021 Language Model Evaluation Beyond Perplexity ACL 2021 Determinantal Beam Search ACL 2021 Is Sparse Attention more Interpretable? ACL 2021 Higher-order Derivatives of Weighted Finite-state Machines ACL 2021 Modeling the Unigram Distribution ACL 2021 SIGMORPHON 2021 Shared Task on Morphological Reinflection: Generalization Across Languages ACL 2021 Disambiguatory Signals are Stronger in Word-initial Positions EACL 2021 Searching for Search Errors in Neural Morphological Inflection EACL 2021 Applying the Transformer to Character-level Transduction EACL 2021 Conditional Poisson Stochastic Beams EMNLP 2021 A surprisal–duration trade-off across and within the world’s languages EMNLP 2021 Revisiting the Uniform Information Density Hypothesis EMNLP 2021 A Bayesian Framework for Information-Theoretic Probing EMNLP 2021 Classifying Dyads for Militarized Conflict Analysis EMNLP 2021 On Homophony and Rényi Entropy EMNLP 2021 Efficient Sampling of Dependency Structure EMNLP 2021 Searching for More Efficient Dynamic Programs EMNLP 2021 A Plug-and-Play Method for Controlled Text Generation EMNLP 2021 Examining the Inductive Bias of Neural Language Models with Artificial Languages IJCNLP 2021 On Finding the K-best Non-projective Dependency Trees IJCNLP 2021 A Cognitive Regularizer for Language Modeling IJCNLP 2021 Language Model Evaluation Beyond Perplexity IJCNLP 2021 Determinantal Beam Search IJCNLP 2021 Is Sparse Attention more Interpretable? IJCNLP 2021 Higher-order Derivatives of Weighted Finite-state Machines IJCNLP 2021 Modeling the Unigram Distribution IJCNLP 2021 SIGMORPHON 2021 Shared Task on Morphological Reinflection: Generalization Across Languages IJCNLP 2021 A Non-Linear Structural Probe NAACL 2021 What About the Precedent: An Information-Theoretic Analysis of Common Law NAACL 2021 Finding Concept-specific Biases in Form–Meaning Associations NAACL 2021 How (Non-)Optimal is the Lexicon? NAACL 2021 SIGTYP 2021 Shared Task: Robust Spoken Language Identification NAACL 2021 Conditional Poisson Stochastic Beam Search EMNLP 2021 Efficient Sampling of Dependency Structures EMNLP 2021 SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection ACL 2020 Morphologically Aware Word-Level Translation COLING 2020 If beam search is the answer, what was the question? EMNLP 2020 Exploring the Linear Subspace Hypothesis in Gender Bias Mitigation EMNLP 2020 Pareto Probing: Trading Off Accuracy for Complexity EMNLP 2020 Speakers Fill Lexical Semantic Gaps with Context EMNLP 2020 Investigating Cross-Linguistic Adjective Ordering Tendencies with a Latent-Variable Model EMNLP 2020 Please Mind the Root: Decoding Arborescences for Dependency Parsing EMNLP 2020 Measuring the Similarity of Grammatical Gender Systems by Comparing Partitions EMNLP 2020 SIGTYP 2020 Shared Task: Prediction of Typological Features EMNLP 2020 It’s Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information ACL 2020 A Corpus for Large-Scale Phonetic Typology ACL 2020 Information-Theoretic Probing for Linguistic Structure ACL 2020 Predicting Declension Class from Form and Meaning ACL 2020 Intrinsic Probing through Dimension Selection EMNLP 2020 Generalized Entropy Regularization or: There’s Nothing Special about Label Smoothing ACL 2020 A Tale of a Probe and a Parser ACL 2020 The Paradigm Discovery Problem ACL 2020 Metaphor Detection using Context and Concreteness ACL 2020 What Kind of Language Is Hard to Language-Model? ACL 2019 Weird Inflects but OK: Making Sense of Morphological Generation Errors CONLL 2019 Don’t Forget the Long Tail! A Comprehensive Analysis of Morphological Generalization in Bilingual Lexicon Induction IJCNLP 2019 Towards Zero-shot Language Modeling IJCNLP 2019 It’s All in the Name: Mitigating Gender Bias with Name-Based Counterfactual Data Substitution IJCNLP 2019 Examining Gender Bias in Languages with Grammatical Gender IJCNLP 2019 Quantifying the Semantic Core of Gender Systems IJCNLP 2019 On the Distribution of Deep Clausal Embeddings: A Large Cross-linguistic Study ACL 2019 Uncovering Probabilistic Implications in Typological Knowledge Bases ACL 2019 Meaning to Form: Measuring Systematicity as Information ACL 2019 Unsupervised Discovery of Gendered Language through Latent-Variable Modeling ACL 2019 Counterfactual Data Augmentation for Mitigating Gender Stereotypes in Languages with Rich Morphology ACL 2019 Exact Hard Monotonic Attention for Character-Level Transduction ACL 2019 The SIGMORPHON 2019 Shared Task: Morphological Analysis in Context and Cross-Lingual Transfer for Inflection ACL 2019 Proceedings of the 16th Workshop on Computational Research in Phonetics, Phonology, and Morphology ACL 2019 Gender Bias in Contextualized Word Embeddings NAACL 2019 Combining Sentiment Lexica with a Multi-View Variational Autoencoder NAACL 2019 A Simple Joint Model for Improved Contextual Neural Lemmatization NAACL 2019 A Probabilistic Generative Model of Linguistic Typology NAACL 2019 Contextualization of Morphological Inflection NAACL 2019 On the Idiosyncrasies of the Mandarin Chinese Classifier System NAACL 2019 Rethinking Phonotactic Complexity ACL 2019 Proceedings of TyP-NLP: The First Workshop on Typology for Polyglot NLP ACL 2019 Morphological Irregularity Correlates with Frequency ACL 2019 Quantifying the Semantic Core of Gender Systems EMNLP 2019 Examining Gender Bias in Languages with Grammatical Gender EMNLP 2019 It’s All in the Name: Mitigating Gender Bias with Name-Based Counterfactual Data Substitution EMNLP 2019 Towards Zero-shot Language Modeling EMNLP 2019 Don’t Forget the Long Tail! A Comprehensive Analysis of Morphological Generalization in Bilingual Lexicon Induction EMNLP 2019 Are All Languages Equally Hard to Language-Model? NAACL 2018 Unsupervised Disambiguation of Syncretism in Inflected Lexicons NAACL 2018 A Structured Variational Autoencoder for Contextual Morphological Inflection ACL 2018 A Deep Generative Model of Vowel Formant Typology NAACL 2018 Marrying Universal Dependencies and Universal Morphology EMNLP 2018 A Discriminative Latent-Variable Model for Bilingual Lexicon Induction EMNLP 2018 Generalizing Procrustes Analysis for Better Bilingual Dictionary Induction CONLL 2018 Proceedings of the CoNLL–SIGMORPHON 2018 Shared Task: Universal Morphological Reinflection CONLL 2018 The CoNLL–SIGMORPHON 2018 Shared Task: Universal Morphological Reinflection CONLL 2018 Hard Non-Monotonic Attention for Character-Level Transduction EMNLP 2018 Context-Aware Prediction of Derivational Word-forms EACL 2017 Explaining and Generalizing Skip-Gram through Exponential Family Principal Component Analysis EACL 2017 Cross-lingual Character-Level Neural Morphological Tagging EMNLP 2017 Morphological Analysis of the Dravidian Language Family EACL 2017 Neural Graphical Models over Strings for Principal Parts Morphological Paradigm Completion EACL 2017 Paradigm Completion for Derivational Morphology EMNLP 2017 CoNLL-SIGMORPHON 2017 Shared Task: Universal Morphological Reinflection in 52 Languages CONLL 2017 Low-Resource Named Entity Recognition with Cross-lingual, Character-Level Neural Conditional Random Fields IJCNLP 2017 One-Shot Neural Cross-Lingual Transfer for Paradigm Completion ACL 2017 Probabilistic Typology: Deep Generative Models of Vowel Inventories ACL 2017 Neural Multi-Source Morphological Reinflection EACL 2017 A Rich Morphological Tagger for English: Exploring the Cross-Linguistic Tradeoff Between Morphology and Syntax EACL 2017 Morphological Segmentation Inside-Out EMNLP 2016 A Joint Model of Orthography and Morphological Segmentation NAACL 2016 Neural Morphological Analysis: Encoding-Decoding Canonical Segments EMNLP 2016 Morphological Smoothing and Extrapolation of Word Embeddings ACL 2016 Speed-Accuracy Tradeoffs in Tagging with Variable-Order CRFs and Structured Sparsity EMNLP 2016 Weighting Finite-State Transductions With Neural Context NAACL 2016 Dual Decomposition Inference for Graphical Models over Strings EMNLP 2015 Labeled Morphological Segmentation with Semi-Markov Models CONLL 2015 Penalized Expectation Propagation for Graphical Models over Strings NAACL 2015 Morphological Word-Embeddings NAACL 2015 Joint Lemmatization and Morphological Tagging with Lemming EMNLP 2015 Stochastic Contextual Edit Distance and Probabilistic FSTs ACL 2014