Sharon Goldwater
63 papers · 2000–2025 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
π Academic Marathon (25) π§ Keyword Pioneer π Interdisciplinary Bridge π Conference Polyglot (10) π Cross-Pollinator (11)
π
Cross-Pollinator
(11)
π
Renaissance Researcher
(7)
πΊοΈ
Taxonomy Completionist
(61)
π±
Topic Pioneer
π
Keyword Champion
π§¬
Topic Evolution
π€
Dynamic Duo
(10)
π
Conference Pioneer
π
Trend Setter
π
Century Club
(63)
ποΈ
Keyword Collector
(171)
π₯
Unstoppable
(10)
β
The Questioner
(6)
β‘
Prolific Year
(5)
Conferences
ACL (15)
EMNLP (13)
NAACL (9)
EACL (8)
INTERSPEECH (6)
CONLL (4)
IJCNLP (3)
COLING (2)
JMLR (2)
NIPS (1)
Top co-authors
Research topics
Keywords
self-supervised learning
(5)
low-resource language
(4)
speech recognition
(4)
adaptor grammars
(3)
phonetic information
(3)
speech translation
(3)
text classification
(3)
dialect identification
(3)
spoken dialogue
(3)
speech representation
(2)
representation learning
(2)
word clustering
(2)
neural model
(2)
cross-linguistic analysis
(2)
unsupervised learning
(2)
disfluency detection
(2)
encoder-decoder architecture
(2)
pitman-yor process
(2)
neural encoder-decoder
(2)
pitch feature
(2)
Papers
The Cross-linguistic Role of Animacy in Grammar Structures
ACL 2025
A Grounded Typology of Word Classes
NAACL 2025
Revisiting Common Assumptions about Arabic Dialects in NLP
ACL 2025
Estimating the Level of Dialectness Predicts Inter-annotator Agreement in Multi-dialect Arabic Datasets
ACL 2024
Orthogonality and isotropy of speaker and phonetic information in self-supervised speech representations
INTERSPEECH 2024
Self-supervised speech representations display some human-like cross-linguistic perceptual abilities
EMNLP 2024
Self-supervised speech representations display some human-like cross-linguistic perceptual abilities
CONLL 2024
ALDi: Quantifying the Arabic Level of Dialectness of Text
EMNLP 2023
Self-supervised Predictive Coding Models Encode Speaker and Phonetic Information in Orthogonal Subspaces
INTERSPEECH 2023
Parsing dialog turns with prosodic features in English
INTERSPEECH 2023
Acoustic Word Embeddings for Untranscribed Target Languages with Continued Pretraining and Learned Pooling
INTERSPEECH 2023
Language-Agnostic Measures Discriminate Inflection and Derivation
EACL 2023
Adaptor Grammars for Unsupervised Paradigm Clustering
ACL 2021
On the Difficulty of Segmenting Words with Attention
EMNLP 2021
[RETRACTED] Prosodic segmentation for parsing spoken dialogue
IJCNLP 2021
Adaptor Grammars for Unsupervised Paradigm Clustering
IJCNLP 2021
A phonetic model of non-native spoken word processing
EACL 2021
[RETRACTED] Prosodic segmentation for parsing spoken dialogue
ACL 2021
The role of context in neural pitch accent detection in English
EMNLP 2020
Conditioning, but on Which Distribution? Grammatical Gender in German Plural Inflection
EMNLP 2020
Inflecting When Thereβs No Majority: Limitations of Encoder-Decoder Neural Networks as Cognitive Models for German Plurals
ACL 2020
Data Augmentation for Context-Sensitive Neural Lemmatization Using Inflection Tables and Raw Text
NAACL 2019
Pre-training on high-resource speech recognition improves low-resource speech-to-text translation
NAACL 2019
Are we there yet? Encoder-decoder neural networks as cognitive models of English past tense inflection
ACL 2019
Multilingual Bottleneck Features for Subword Modeling in Zero-resource Languages
INTERSPEECH 2018
Context Sensitive Neural Lemmatization with Lematus
NAACL 2018
Evaluating Historical Text Normalization Systems: How Well Do They Generalize?
NAACL 2018
Inducing a lexicon of sociolinguistic variables from code-mixed text
EMNLP 2018
Low-Resource Speech-to-Text Translation
INTERSPEECH 2018
From Segmentation to Analyses: a Probabilistic Model for Unsupervised Morphology Induction
EACL 2017
Aye or naw, whit dae ye hink? Scottish independence and linguistic identity on social media
EACL 2017
Towards speech-to-text translation without speech recognition
EACL 2017
Training Data Augmentation for Low-Resource Morphological Inflection
CONLL 2017
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, volume 2: Short Papers
EACL 2014
Weak semantic context helps phonetic learning in a model of infant language acquisition
ACL 2014
POS induction with distributional and morphological information using a distance-dependent Chinese restaurant process
ACL 2014
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics
EACL 2014
Exploring the Utility of Joint Morphological and Syntactic Learning from Child-directed Speech
EMNLP 2013
A Joint Learning Model of Word Segmentation, Lexical Acquisition, and Phonetic Variability
EMNLP 2013
Semantic Parsing with Bayesian Tree Transducers
ACL 2012
Bootstrapping a Unified Model of Lexical and Phonetic Acquisition
ACL 2012
Turning the pipeline into a loop: Iterated unsupervised dependency parsing and PoS induction
NAACL 2012
A Probabilistic Model of Syntactic and Semantic Acquisition from Child-Directed Utterances and their Meanings
EACL 2012
Proceedings of the Fifteenth Conference on Computational Natural Language Learning
CONLL 2011
A Bayesian Mixture Model for PoS Induction Using Multiple Features
EMNLP 2011
Lexical Generalization in CCG Grammar Induction for Semantic Parsing
EMNLP 2011
Producing Power-Law Distributions and Damping Word Frequencies with Two-Stage Language Models
JMLR 2011
Two Decades of Unsupervised POS Induction: How Far Have We Come?
EMNLP 2010
Inducing Tree-Substitution Grammars
JMLR 2010
Inducing Probabilistic CCG Grammars from Logical Form with Higher-Order Unification
EMNLP 2010
Inducing Compact but Accurate Tree-Substitution Grammars
NAACL 2009
A Note on the Implementation of Hierarchical Dirichlet Processes
ACL 2009
A Note on the Implementation of Hierarchical Dirichlet Processes
IJCNLP 2009
Improving nonparameteric Bayesian inference: experiments on unsupervised word segmentation with adaptor grammars
NAACL 2009
Which Words Are Hard to Recognize? Prosodic, Lexical, and Disfluency Factors that Increase ASR Error Rates
ACL 2008
Bayesian Inference for PCFGs via Markov Chain Monte Carlo
NAACL 2007
A fully Bayesian approach to unsupervised part-of-speech tagging
ACL 2007
Contextual Dependencies in Unsupervised Word Segmentation
ACL 2006
Adaptor Grammars: A Framework for Specifying Compositional Nonparametric Bayesian Models
NIPS 2006
Contextual Dependencies in Unsupervised Word Segmentation
COLING 2006
Representational Bias in Unsupervised Learning of Syllable Structure
CONLL 2005
Improving Statistical MT through Morphological Analysis
EMNLP 2005
Compiling Language Models from a Linguistically Motivated Unification Grammar
COLING 2000