Thomas Drugman

17 papers · 2016–2023 · 2 conferences · across top CS/AI conferences

Achievements

+9 more ↓

🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (11) 🧭 Keyword Pioneer 🌍 Conference Polyglot (2)

🌈 Renaissance Researcher (5) 🌍 Conference Polyglot (2) 🏃 Academic Marathon (7) 🏆 Keyword Champion (4) 🗃️ Keyword Collector (77) 💎 Century Club (17) 🔥 Unstoppable (5) 📈 Trend Setter 🚀 Conference Pioneer

Conferences

INTERSPEECH (16) NAACL (1)

Top co-authors

Alexis Moinet (9) Jaime Lorenzo-Trueba (6) Sri Karlapati (6) Roberto Barra-Chicote (5) Penny Karanasou (4) Thomas Merritt (4) Daniel Korzekwa (3) Viacheslav Klimkov (3) Arnaud Joly (3) Bozena Kostek (3)

Keywords

text-to-speech synthesis (5) prosody transfer (4) speech synthesis (3) variational autoencoder (3) fine-grained prosody (3) non-native speech (2) multi-speaker text-to-speech (2) speech naturalness (2) latent space (2) automatic speech recognition (2) attention mechanism (2) expressive speech (2) speaker-independent representation (2) multilabel classification (1) stratified sampling (1) semi-supervised training (1) speaker embedding (1) data augmentation (1) multi-task learning (1) prosody analysis (1)

Papers

eCat: An End-to-End Model for Multi-Speaker TTS & Many-to-Many Fine-Grained Prosody Transfer INTERSPEECH 2023 Expressive, Variable, and Controllable Duration Modelling in TTS INTERSPEECH 2022 CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer INTERSPEECH 2022 Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody INTERSPEECH 2022 A Learned Conditional Prior for the VAE Acoustic Space of a TTS System INTERSPEECH 2021 Weakly-Supervised Word-Level Pronunciation Error Detection in Non-Native English Speech INTERSPEECH 2021 Detection of Lexical Stress Errors in Non-Native (L2) English with Data Augmentation and Attention INTERSPEECH 2021 CopyCat: Many-to-Many Fine-Grained Prosody Transfer for Neural Text-to-Speech INTERSPEECH 2020 Dynamic Prosody Generation for Speech Synthesis Using Linguistics-Driven Acoustic Embedding Selection INTERSPEECH 2020 Singing Synthesis: With a Little Help from my Attention INTERSPEECH 2020 In Other News: a Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited Data NAACL 2019 Towards Achieving Robust Universal Neural Vocoding INTERSPEECH 2019 Interpretable Deep Learning Model for the Detection and Reconstruction of Dysarthric Speech INTERSPEECH 2019 Fine-Grained Robust Prosody Transfer for Single-Speaker Neural Text-To-Speech INTERSPEECH 2019 Phrase Break Prediction for Long-Form Reading TTS: Exploiting Text Structure Information INTERSPEECH 2017 Optimizing Speech Recognition Evaluation Using Stratified Sampling INTERSPEECH 2016 Active and Semi-Supervised Learning in ASR: Benefits on the Acoustic and Language Models INTERSPEECH 2016