Emmanuel Dupoux

61 papers · 2008–2026 · 10 conferences · across top CS/AI conferences

Achievements

+13 more ↓

🧭 Keyword Pioneer 🌈 Renaissance Researcher (6) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (16) 🐣 Hot Topic Early Bird

🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (10) 🗺️ Taxonomy Completionist (16) 🏠 Conference Loyalist (23) 🌟 Keyword Trendsetter Combo (4) 🔬 Deep Specialist (11) 🏆 Keyword Champion (3) 💎 Century Club (59) 🚀 Conference Pioneer 📈 Trend Setter ⚡ Prolific Year (10) 🔥 Unstoppable (13) 🗃️ Keyword Collector (206)

Conferences

INTERSPEECH (23) ACL (15) EMNLP (9) CONLL (3) NAACL (3) NIPS (3) ICLR (2) COLING (1) EACL (1) IJCNLP (1)

Top co-authors

Yossi Adi (9) Eugene Kharitonov (8) Jade Copet (8) Gabriel Synnaeve (8) Tu Anh Nguyen (8) Robin Algayres (7) Ewan Dunbar (7) Rahma Chaabouni (7) Wei-Ning Hsu (5) Benoît Sagot (5)

Research topics

Analysis (1) Linguistics (1)

Keywords

unsupervised learning (9) self-supervised learning (9) speech representation (7) emergent communication (6) speech synthesis (5) speech representation learning (4) discrete representation (4) language model (4) siamese network (4) neural network (4) speech resynthesis (3) representation learning (3) language acquisition (3) zipf law (3) referential game (3) spoken language modeling (3) word embedding (3) spoken language model (3) zero-shot learning (2) benchmark evaluation (2)

Papers

MauBERT: Universal Phonetic Inductive Biases for Few-Shot Acoustic Units Discovery ACL 2026 SpidR-Adapt: A Universal Speech Representation Model for Few-Shot Adaptation ACL 2026 Frequency & Compositionality in Emergent Communication EMNLP 2025 LongTail-Swap: benchmarking language models’ abilities on rare words EMNLP 2025 Improving Spoken Language Modeling with Phoneme Classification: A Simple Fine-tuning Approach EMNLP 2024 EmphAssess : a Prosodic Benchmark on Assessing Emphasis Transfer in Speech-to-Speech Models EMNLP 2024 Simulating articulatory trajectories with phonological feature interpolation INTERSPEECH 2024 Countering Reward Over-Optimization in LLM with Demonstration-Guided Reinforcement Learning ACL 2024 BabySLM: language-acquisition-friendly benchmark of self-supervised spoken language models INTERSPEECH 2023 Evaluating context-invariance in unsupervised speech representations INTERSPEECH 2023 ProsAudit, a prosodic benchmark for self-supervised speech models INTERSPEECH 2023 Neural Agents Struggle to Take Turns in Bidirectional Emergent Communication ICLR 2023 Textually Pretrained Speech Language Models NIPS 2023 Expresso: A Benchmark and Analysis of Discrete Expressive Speech Resynthesis INTERSPEECH 2023 Augmentation Invariant Discrete Representation for Generative Spoken Language Modeling ACL 2023 XLS-R fine-tuning on noisy word boundaries for unsupervised speech segmentation into words EMNLP 2023 Generative Spoken Language Model based on continuous word-sized audio tokens EMNLP 2023 Measuring Language Development From Child-centered Recordings INTERSPEECH 2023 Textless Speech Emotion Conversion using Discrete & Decomposed Representations EMNLP 2022 Emergent Communication: Generalization and Overfitting in Lewis Games NIPS 2022 Text-Free Prosody-Aware Generative Spoken Language Modeling ACL 2022 A comparison study on patient-psychologist voice diarization ACL 2022 On the role of population heterogeneity in emergent communication ICLR 2022 Probing phoneme, language and speaker information in unsupervised speech representations INTERSPEECH 2022 Speech Sequence Embeddings using Nearest Neighbors Contrastive Learning INTERSPEECH 2022 textless-lib: a Library for Textless Spoken Language Processing NAACL 2022 VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation IJCNLP 2021 VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation ACL 2021 Speech Resynthesis from Discrete Disentangled Self-Supervised Representations INTERSPEECH 2021 The Zero Resource Speech Challenge 2021: Spoken Language Modelling INTERSPEECH 2021 Analogies minus analogy test: measuring regularities in word embeddings CONLL 2020 “LazImpa”: Lazy and Impatient neural agents learn to communicate efficiently CONLL 2020 Vocal Markers from Sustained Phonation in Huntington’s Disease INTERSPEECH 2020 An Open-Source Voice Type Classifier for Child-Centered Daylong Recordings INTERSPEECH 2020 Evaluating the Reliability of Acoustic Speech Embeddings INTERSPEECH 2020 The Zero Resource Speech Challenge 2020: Discovering Discrete Subword and Word Units INTERSPEECH 2020 Compositionality and Generalization In Emergent Languages ACL 2020 Analogies minus analogy test: measuring regularities in word embeddings EMNLP 2020 “LazImpa”: Lazy and Impatient neural agents learn to communicate efficiently EMNLP 2020 Word-order Biases in Deep-agent Emergent Communication ACL 2019 Anti-efficient encoding in emergent communication NIPS 2019 The Zero Resource Speech Challenge 2019: TTS Without T INTERSPEECH 2019 End-to-End Speech Recognition from the Raw Waveform INTERSPEECH 2018 Learning Word Embeddings: Unsupervised Methods for Fixed-size Representations of Variable-length Speech Segments INTERSPEECH 2018 Sampling Strategies in Siamese Networks for Unsupervised Speech Representation Learning INTERSPEECH 2018 Comparing Character-level Neural Language Models Using a Lexical Decision Task EACL 2017 Blind Phoneme Segmentation With Temporal Prediction Errors ACL 2017 The Role of Prosody and Speech Register in Word Segmentation: A Computational Modelling Perspective ACL 2017 Predicting Epenthetic Vowel Quality from Acoustics INTERSPEECH 2017 Relating Unsupervised Word Segmentation to Reported Vocabulary Acquisition INTERSPEECH 2017 Learning Weakly Supervised Multimodal Phoneme Embeddings INTERSPEECH 2017 A Quantitative Measure of the Impact of Coarticulation on Phone Discriminability INTERSPEECH 2017 Joint Learning of Speaker and Phonetic Similarities with Siamese Networks INTERSPEECH 2016 Sign constraints on feature weights improve a joint model of word segmentation and phonology NAACL 2015 Prosodic boundary information helps unsupervised word segmentation NAACL 2015 Exploring the Relative Role of Bottom-up and Top-down Information in Phoneme Learning ACL 2014 Unsupervised Word Segmentation in Context COLING 2014 A Rudimentary Lexicon and Semantics Help Bootstrap Phoneme Acquisition CONLL 2014 Modelling function words improves unsupervised word segmentation ACL 2014 A corpus-based evaluation method for Distributional Semantic Models ACL 2013 Unsupervised Learning of Acoustic Sub-word Units ACL 2008