Emmanuel Dupoux
61 papers · 2008–2026 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
🧭 Keyword Pioneer 🌈 Renaissance Researcher (6) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (16) 🐣 Hot Topic Early Bird
🌉
Interdisciplinary Bridge
🌍
Conference Polyglot
(10)
🗺️
Taxonomy Completionist
(16)
🏠
Conference Loyalist
(23)
🌟
Keyword Trendsetter Combo
(4)
🔬
Deep Specialist
(11)
🏆
Keyword Champion
(3)
💎
Century Club
(59)
🚀
Conference Pioneer
📈
Trend Setter
⚡
Prolific Year
(10)
🔥
Unstoppable
(13)
🗃️
Keyword Collector
(206)
Conferences
INTERSPEECH (23)
ACL (15)
EMNLP (9)
CONLL (3)
NAACL (3)
NIPS (3)
ICLR (2)
COLING (1)
EACL (1)
IJCNLP (1)
Top co-authors
Research topics
Keywords
unsupervised learning
(9)
self-supervised learning
(9)
speech representation
(7)
emergent communication
(6)
speech synthesis
(5)
speech representation learning
(4)
discrete representation
(4)
language model
(4)
siamese network
(4)
neural network
(4)
speech resynthesis
(3)
representation learning
(3)
language acquisition
(3)
zipf law
(3)
referential game
(3)
spoken language modeling
(3)
word embedding
(3)
spoken language model
(3)
zero-shot learning
(2)
benchmark evaluation
(2)
Papers
MauBERT: Universal Phonetic Inductive Biases for Few-Shot Acoustic Units Discovery
ACL 2026
SpidR-Adapt: A Universal Speech Representation Model for Few-Shot Adaptation
ACL 2026
Frequency & Compositionality in Emergent Communication
EMNLP 2025
LongTail-Swap: benchmarking language models’ abilities on rare words
EMNLP 2025
Improving Spoken Language Modeling with Phoneme Classification: A Simple Fine-tuning Approach
EMNLP 2024
EmphAssess : a Prosodic Benchmark on Assessing Emphasis Transfer in Speech-to-Speech Models
EMNLP 2024
Simulating articulatory trajectories with phonological feature interpolation
INTERSPEECH 2024
Countering Reward Over-Optimization in LLM with Demonstration-Guided Reinforcement Learning
ACL 2024
BabySLM: language-acquisition-friendly benchmark of self-supervised spoken language models
INTERSPEECH 2023
Evaluating context-invariance in unsupervised speech representations
INTERSPEECH 2023
ProsAudit, a prosodic benchmark for self-supervised speech models
INTERSPEECH 2023
Neural Agents Struggle to Take Turns in Bidirectional Emergent Communication
ICLR 2023
Textually Pretrained Speech Language Models
NIPS 2023
Expresso: A Benchmark and Analysis of Discrete Expressive Speech Resynthesis
INTERSPEECH 2023
Augmentation Invariant Discrete Representation for Generative Spoken Language Modeling
ACL 2023
XLS-R fine-tuning on noisy word boundaries for unsupervised speech segmentation into words
EMNLP 2023
Generative Spoken Language Model based on continuous word-sized audio tokens
EMNLP 2023
Measuring Language Development From Child-centered Recordings
INTERSPEECH 2023
Textless Speech Emotion Conversion using Discrete & Decomposed Representations
EMNLP 2022
Emergent Communication: Generalization and Overfitting in Lewis Games
NIPS 2022
Text-Free Prosody-Aware Generative Spoken Language Modeling
ACL 2022
A comparison study on patient-psychologist voice diarization
ACL 2022
On the role of population heterogeneity in emergent communication
ICLR 2022
Probing phoneme, language and speaker information in unsupervised speech representations
INTERSPEECH 2022
Speech Sequence Embeddings using Nearest Neighbors Contrastive Learning
INTERSPEECH 2022
textless-lib: a Library for Textless Spoken Language Processing
NAACL 2022
VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation
IJCNLP 2021
VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation
ACL 2021
Speech Resynthesis from Discrete Disentangled Self-Supervised Representations
INTERSPEECH 2021
The Zero Resource Speech Challenge 2021: Spoken Language Modelling
INTERSPEECH 2021
Analogies minus analogy test: measuring regularities in word embeddings
CONLL 2020
“LazImpa”: Lazy and Impatient neural agents learn to communicate efficiently
CONLL 2020
Vocal Markers from Sustained Phonation in Huntington’s Disease
INTERSPEECH 2020
An Open-Source Voice Type Classifier for Child-Centered Daylong Recordings
INTERSPEECH 2020
Evaluating the Reliability of Acoustic Speech Embeddings
INTERSPEECH 2020
The Zero Resource Speech Challenge 2020: Discovering Discrete Subword and Word Units
INTERSPEECH 2020
Compositionality and Generalization In Emergent Languages
ACL 2020
Analogies minus analogy test: measuring regularities in word embeddings
EMNLP 2020
“LazImpa”: Lazy and Impatient neural agents learn to communicate efficiently
EMNLP 2020
Word-order Biases in Deep-agent Emergent Communication
ACL 2019
Anti-efficient encoding in emergent communication
NIPS 2019
The Zero Resource Speech Challenge 2019: TTS Without T
INTERSPEECH 2019
End-to-End Speech Recognition from the Raw Waveform
INTERSPEECH 2018
Learning Word Embeddings: Unsupervised Methods for Fixed-size Representations of Variable-length Speech Segments
INTERSPEECH 2018
Sampling Strategies in Siamese Networks for Unsupervised Speech Representation Learning
INTERSPEECH 2018
Comparing Character-level Neural Language Models Using a Lexical Decision Task
EACL 2017
Blind Phoneme Segmentation With Temporal Prediction Errors
ACL 2017
The Role of Prosody and Speech Register in Word Segmentation: A Computational Modelling Perspective
ACL 2017
Predicting Epenthetic Vowel Quality from Acoustics
INTERSPEECH 2017
Relating Unsupervised Word Segmentation to Reported Vocabulary Acquisition
INTERSPEECH 2017
Learning Weakly Supervised Multimodal Phoneme Embeddings
INTERSPEECH 2017
A Quantitative Measure of the Impact of Coarticulation on Phone Discriminability
INTERSPEECH 2017
Joint Learning of Speaker and Phonetic Similarities with Siamese Networks
INTERSPEECH 2016
Sign constraints on feature weights improve a joint model of word segmentation and phonology
NAACL 2015
Prosodic boundary information helps unsupervised word segmentation
NAACL 2015
Exploring the Relative Role of Bottom-up and Top-down Information in Phoneme Learning
ACL 2014
Unsupervised Word Segmentation in Context
COLING 2014
A Rudimentary Lexicon and Semantics Help Bootstrap Phoneme Acquisition
CONLL 2014
Modelling function words improves unsupervised word segmentation
ACL 2014
A corpus-based evaluation method for Distributional Semantic Models
ACL 2013
Unsupervised Learning of Acoustic Sub-word Units
ACL 2008