David R. Mortensen
37 papers · 2016–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
π Conference Polyglot (7) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (16) π§ Keyword Pioneer π Academic Marathon (9)
π
Interdisciplinary Bridge
π
Conference Polyglot
(7)
π
Academic Marathon
(9)
π¬
Deep Specialist
(10)
ποΈ
Keyword Collector
(167)
β‘
Prolific Year
(6)
π
Century Club
(32)
π₯
Unstoppable
(10)
π
Trend Setter
β
The Questioner
Conferences
ACL (10)
COLING (9)
EMNLP (6)
INTERSPEECH (6)
EACL (4)
AACL (1)
IJCNLP (1)
Top co-authors
Research topics
Keywords
low-resource language
(11)
machine translation
(7)
large language model
(6)
automatic speech recognition
(5)
phone recognition
(4)
named entity recognition
(4)
transfer learning
(4)
emergent language
(3)
multilingual speech
(3)
protoform reconstruction
(3)
cross-lingual transfer
(3)
speech recognition
(3)
speech synthesis
(2)
data augmentation
(2)
natural language inference
(2)
interlinear glossing
(2)
multilingual generation
(2)
multilingual model
(2)
synthetic data generation
(2)
representation learning
(2)
Papers
Communicating in Emergent Language with an Induced Morphological Phrasebook
ACL 2026
PRiSM: Benchmarking Phone Realization in Speech Models
ACL 2026
POWSM: A Phonetic Open Whisper-Style Speech Foundation Model
ACL 2026
Happiness is Sharing a Vocabulary: A Study of Transliteration Methods
EACL 2026
From sunblock to softblock: Analyzing the correlates of neology in published writing and on social media
EACL 2026
The Translation Barrier Hypothesis: Multilingual Generation with Large Language Models Suffers from Implicit Translation Failure
AACL 2025
ZIPA: A family of efficient models for multilingual phone recognition
ACL 2025
DialUp! Modeling the Language Continuum by Adapting Models to Dialects and Dialects to Models
ACL 2025
Programming by Example meets Historical Linguistics: A Large Language Model Based Approach to Sound Law Induction
ACL 2025
Searching for the Most Human-like Emergent Language
EMNLP 2025
Morpheme Induction for Emergent Language
EMNLP 2025
The Translation Barrier Hypothesis: Multilingual Generation with Large Language Models Suffers from Implicit Translation Failure
IJCNLP 2025
Constructions Are So Difficult That Even Large Language Models Get Them Right for the Wrong Reasons
COLING 2024
Improved Neural Protoform Reconstruction via Reflex Prediction
COLING 2024
Phonotactic Complexity across Dialects
COLING 2024
PWESuite: Phonetic Word Embeddings and Tasks They Facilitate
COLING 2024
Verbing Weirds Language (Models): Evaluation of English Zero-Derivation in Five LLMs
COLING 2024
Self-supervised Speech Representations Still Struggle with African American Vernacular English
INTERSPEECH 2024
Transformed Protoform Reconstruction
ACL 2023
ChatGPT MT: Competitive for High- (but Not Low-) Resource Languages
EMNLP 2023
Generalized Glossing Guidelines: An Explicit, Human- and Machine-Readable, Item-and-Process Convention for Morphological Annotation
ACL 2023
SigMoreFun Submission to the SIGMORPHON Shared Task on Interlinear Glossing
ACL 2023
Data-adaptive Transfer Learning for Translation: A Case Study in Haitian and Jamaican
COLING 2022
WikiHan: A New Comparative Dataset for Chinese Languages
COLING 2022
When Is TTS Augmentation Through a Pivot Language Useful?
INTERSPEECH 2022
ASR2K: Speech Recognition for Around 2000 Languages without Audio
INTERSPEECH 2022
Tusom2021: A Phonetically Transcribed Speech Dataset from an Endangered Language for Universal Phone Recognition Experiments
INTERSPEECH 2021
Evaluating the Morphosyntactic Well-formedness of Generated Texts
EMNLP 2021
Cross-Cultural Similarity Features for Cross-Lingual Transfer Learning of Pragmatically Motivated Tasks
EACL 2021
Phoneme Recognition Through Fine Tuning of Phonetic Representations: A Case Study on Luhya Language Varieties
INTERSPEECH 2021
Differentiable Allophone Graphs for Language-Universal Speech Recognition
INTERSPEECH 2021
Automatic Extraction of Rules Governing Morphological Agreement
EMNLP 2020
CMU-01 at the SIGMORPHON 2019 Shared Task on Crosslinguality and Context in Morphology
ACL 2019
Adapting Word Embeddings to New Languages with Morphological and Phonological Subword Representations
EMNLP 2018
URIEL and lang2vec: Representing languages as typological, geographical, and phylogenetic vectors
EACL 2017
Named Entity Recognition for Linguistic Rapid Response in Low-Resource Languages: Sorani Kurdish and Tajik
COLING 2016
PanPhon: A Resource for Mapping IPA Segments to Articulatory Feature Vectors
COLING 2016