David Yarowsky
70 papers · 2000–2025 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
π Conference Polyglot (9) π Academic Marathon (25) π Interdisciplinary Bridge π§ Keyword Pioneer π Cross-Pollinator (12)
π
Renaissance Researcher
(8)
π
Cross-Pollinator
(12)
π
Conference Polyglot
(9)
π
Conference Loyalist
(20)
π€
Dynamic Duo
(10)
π₯
Mega-Team
(58)
π§¬
Topic Evolution
π
Keyword Champion
(3)
ποΈ
Keyword Collector
(111)
π
Conference Pioneer
β‘
Prolific Year
(5)
π₯
Unstoppable
(10)
π
Century Club
(70)
Conferences
ACL (20)
IJCNLP (12)
EMNLP (11)
NAACL (10)
CONLL (8)
COLING (5)
EACL (2)
AACL (1)
SEMEVAL (1)
Top co-authors
Research topics
Keywords
low-resource language
(5)
word formation
(3)
cross-lingual generalization
(3)
machine translation
(3)
large language model
(3)
multilingual nlp
(2)
color terminology
(2)
morphological analysis
(2)
typological diversity
(2)
computational linguistics
(2)
morphological reinflection
(2)
neural network
(2)
cross-linguistic analysis
(2)
basic color term
(2)
multilingual generation
(2)
transformer model
(2)
multilingual model
(2)
universal dependencies
(1)
cross-lingual embedding
(1)
text generation
(1)
Papers
JHUβs Submission to the AmericasNLP 2025 Shared Task on the Creation of Educational Materials for Indigenous Languages
NAACL 2025
The Translation Barrier Hypothesis: Multilingual Generation with Large Language Models Suffers from Implicit Translation Failure
IJCNLP 2025
The Translation Barrier Hypothesis: Multilingual Generation with Large Language Models Suffers from Implicit Translation Failure
AACL 2025
DialUp! Modeling the Language Continuum by Adapting Models to Dialects and Dialects to Models
ACL 2025
JHU IWSLT 2025 Low-resource System Description
ACL 2025
Pointer-Generator Networks for Low-Resource Machine Translation: Donβt Copy That!
NAACL 2024
Evaluating Large Language Models along Dimensions of Language Variation: A Systematik Invesdigatiom uv Cross-lingual Generalization
EMNLP 2024
Deciphering and Characterizing Out-of-Vocabulary Words for Morphologically Rich Languages
COLING 2022
Known Words Will Do: Unknown Concept Translation via Lexical Relations
COLING 2022
Sequence Models for Computational Etymology of Borrowings
ACL 2021
SIGMORPHON 2021 Shared Task on Morphological Reinflection: Generalization Across Languages
IJCNLP 2021
Sequence Models for Computational Etymology of Borrowings
IJCNLP 2021
SIGMORPHON 2021 Shared Task on Morphological Reinflection: Generalization Across Languages
ACL 2021
Induced Inflection-Set Keyword Search in Speech
ACL 2020
Neural Transduction for Multilingual Lexical Translation
COLING 2020
Wiktionary Normalization of Translations and Morphological Information
COLING 2020
Measuring the Similarity of Grammatical Gender Systems by Comparing Partitions
EMNLP 2020
Massively Multilingual Adversarial Speech Recognition
NAACL 2019
Modeling Color Terminology Across Thousands of Languages
EMNLP 2019
Learning Morphosyntactic Analyzers from the Bible via Iterative Annotation Projection across 26 Languages
ACL 2019
Modeling Color Terminology Across Thousands of Languages
IJCNLP 2019
Marrying Universal Dependencies and Universal Morphology
EMNLP 2018
The CoNLLβSIGMORPHON 2018 Shared Task: Universal Morphological Reinflection
CONLL 2018
Deriving Consensus for Multi-Parallel Corpora: an English Bible Study
IJCNLP 2017
Paradigm Completion for Derivational Morphology
EMNLP 2017
CoNLL-SIGMORPHON 2017 Shared Task: Universal Morphological Reinflection in 52 Languages
CONLL 2017
Cross-lingual Dependency Parsing Based on Distributed Representations
ACL 2015
A Language-Independent Feature Schema for Inflectional Morphology
ACL 2015
Cross-lingual Dependency Parsing Based on Distributed Representations
IJCNLP 2015
Social Media Predictive Analytics
NAACL 2015
A Language-Independent Feature Schema for Inflectional Morphology
IJCNLP 2015
Exploring Sentiment in Social Media: Bootstrapping Subjectivity Clues from Multilingual Twitter Streams
ACL 2013
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing
EMNLP 2013
Broadly Improving User Classification via Communication-Based Name and Location Clustering on Twitter
NAACL 2013
Exploring Demographic Language Variations to Improve Multilingual Sentiment Analysis in Social Media
EMNLP 2013
Toward Statistical Machine Translation without Parallel Corpora
EACL 2012
Stylometric Analysis of Scientific Articles
NAACL 2012
Proceedings of 5th International Joint Conference on Natural Language Processing
IJCNLP 2011
Using Large Monolingual and Bilingual Corpora to Improve Coordination Disambiguation
ACL 2011
Typed Graph Models for Learning Latent Attributes from Names
ACL 2011
Modeling Latent Biographic Attributes in Conversational Genres
ACL 2009
Arabic Cross-Document Coreference Resolution
IJCNLP 2009
Improving Translation Lexicon Induction from Monolingual Corpora via Dependency Contexts and Part-of-Speech Equivalences
CONLL 2009
Structural, Transitive and Latent Models for Biographic Fact Extraction
EACL 2009
Modeling Latent Biographic Attributes in Conversational Genres
IJCNLP 2009
Arabic Cross-Document Coreference Resolution
ACL 2009
Unsupervised Translation Induction for Chinese Abbreviations using Monolingual Corpora
ACL 2008
Translating Compounds by Learning Component Gloss Translation Models via Multiple Languages
IJCNLP 2008
Minimally Supervised Multilingual Taxonomy and Translation Lexicon Induction
IJCNLP 2008
Mining and Modeling Relations between Formal and Informal Chinese Phrases from Web Corpora
EMNLP 2008
JHU1 : An Unsupervised Approach to Person Name Disambiguation using Web Snippets
SEMEVAL 2007
Resolving and Generating Definite Anaphora by Modeling Hypernymy using Unlabeled Corpora
CONLL 2006
Multi-Field Information Extraction and Cross-Document Fusion
ACL 2005
Improving Bitext Word Alignments via Syntax-based Reordering of English
ACL 2004
Exploiting Aggregate Properties of Bilingual Dictionaries For Distinguishing Senses of English Words and Inducing English Sense Clusters
ACL 2004
Statistical Machine Translation Using Coercive Two-Level Syntactic Transduction
EMNLP 2003
Unsupervised Personal Name Disambiguation
CONLL 2003
Desparately Seeking Cebuano
NAACL 2003
Minimally Supervised Induction of Grammatical Gender
NAACL 2003
Modeling Consensus: Classifier Combination for Word Sense Disambiguation
EMNLP 2002
Inducing Translation Lexicons via Diverse Similarity Measures and Bridge Languages
CONLL 2002
Language Independent NER using a Unified Model of Internal and Contextual Evidence
CONLL 2002
Bootstrapping a Multilingual Part-of-speech Tagger in One Person-day
CONLL 2002
Inducing Information Extraction Systems for New Languages via Cross-language Projection
COLING 2002
Augmented Mixture Models for Lexical Disambiguation
EMNLP 2002
Multipath Translation Lexicon Induction via Bridge Languages
NAACL 2001
Inducing Multilingual POS Taggers and NP Bracketers via Robust Projection Across Aligned Corpora
NAACL 2001
Language Independent, Minimally Supervised Induction of Lexical Probabilities
ACL 2000
Minimally Supervised Morphological Analysis by Multimodal Alignment
ACL 2000
Rule Writing or Annotation: Cost-efficient Resource Usage for Base Noun Phrase Chunking
ACL 2000