Thomas Drugman
17 papers · 2016–2023 · 2 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+9 more ↓ Show less ↑
π Renaissance Researcher (5) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (11) π§ Keyword Pioneer π Conference Polyglot (2)
π
Renaissance Researcher
(5)
π
Conference Polyglot
(2)
π
Academic Marathon
(7)
π
Keyword Champion
(4)
ποΈ
Keyword Collector
(77)
π
Century Club
(17)
π₯
Unstoppable
(5)
π
Trend Setter
π
Conference Pioneer
Conferences
INTERSPEECH (16)
NAACL (1)
Top co-authors
Keywords
text-to-speech synthesis
(5)
prosody transfer
(4)
speech synthesis
(3)
variational autoencoder
(3)
fine-grained prosody
(3)
non-native speech
(2)
multi-speaker text-to-speech
(2)
speech naturalness
(2)
latent space
(2)
automatic speech recognition
(2)
attention mechanism
(2)
expressive speech
(2)
speaker-independent representation
(2)
multilabel classification
(1)
stratified sampling
(1)
semi-supervised training
(1)
speaker embedding
(1)
data augmentation
(1)
multi-task learning
(1)
prosody analysis
(1)
Papers
eCat: An End-to-End Model for Multi-Speaker TTS & Many-to-Many Fine-Grained Prosody Transfer
INTERSPEECH 2023
Expressive, Variable, and Controllable Duration Modelling in TTS
INTERSPEECH 2022
CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer
INTERSPEECH 2022
Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody
INTERSPEECH 2022
A Learned Conditional Prior for the VAE Acoustic Space of a TTS System
INTERSPEECH 2021
Weakly-Supervised Word-Level Pronunciation Error Detection in Non-Native English Speech
INTERSPEECH 2021
Detection of Lexical Stress Errors in Non-Native (L2) English with Data Augmentation and Attention
INTERSPEECH 2021
CopyCat: Many-to-Many Fine-Grained Prosody Transfer for Neural Text-to-Speech
INTERSPEECH 2020
Dynamic Prosody Generation for Speech Synthesis Using Linguistics-Driven Acoustic Embedding Selection
INTERSPEECH 2020
Singing Synthesis: With a Little Help from my Attention
INTERSPEECH 2020
In Other News: a Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited Data
NAACL 2019
Towards Achieving Robust Universal Neural Vocoding
INTERSPEECH 2019
Interpretable Deep Learning Model for the Detection and Reconstruction of Dysarthric Speech
INTERSPEECH 2019
Fine-Grained Robust Prosody Transfer for Single-Speaker Neural Text-To-Speech
INTERSPEECH 2019
Phrase Break Prediction for Long-Form Reading TTS: Exploiting Text Structure Information
INTERSPEECH 2017
Optimizing Speech Recognition Evaluation Using Stratified Sampling
INTERSPEECH 2016
Active and Semi-Supervised Learning in ASR: Benefits on the Acoustic and Language Models
INTERSPEECH 2016