Roberto Barra-Chicote

16 papers · 2016–2023 · 4 conferences · across top CS/AI conferences

Achievements

+9 more ↓

🌍 Conference Polyglot (4) 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (5) 🧭 Keyword Pioneer 🏃 Academic Marathon (7)

🏃 Academic Marathon (7) 🐝 Cross-Pollinator (8) 🗺️ Taxonomy Completionist (23) 🏆 Keyword Champion (2) 🔥 Unstoppable (5) 🗃️ Keyword Collector (59) 💎 Century Club (16) 📈 Trend Setter ⚡ Prolific Year (5)

Conferences

INTERSPEECH (13) ACL (1) COLING (1) NAACL (1)

Top co-authors

Thomas Merritt (6) Thomas Drugman (5) Daniel Korzekwa (5) Jaime Lorenzo-Trueba (5) Robert Enyedi (4) Marcello Federico (4) Jasha Droppo (3) Grzegorz Beringer (3) Yogesh Virkar (3) Abdelhamid Ezzerg (3)

Keywords

normalizing flow (5) speech synthesis (4) automatic dubbing (4) prosodic alignment (3) data augmentation (2) acoustic modeling (2) voice conversion (2) neural vocoding (2) text-to-speech synthesis (2) neural text-to-speech (2) speech-to-speech translation (2) speaker embedding (2) prosody modeling (2) probability density (1) automatic speech recognition (1) zero-shot learning (1) prosody analysis (1) latent space (1) non-native speech (1) attention mechanism (1)

Papers

Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech INTERSPEECH 2023 GlowVC: Mel-spectrogram space disentangling model for language-independent text-free voice conversion INTERSPEECH 2022 Creating New Voices using Normalizing Flows INTERSPEECH 2022 Prosodic alignment for off-screen automatic dubbing INTERSPEECH 2022 Intra-Sentential Speaking Rate Control in Neural Text-To-Speech for Automatic Dubbing INTERSPEECH 2021 Improving Multi-Speaker TTS Prosody Variance with a Residual Encoder and Normalizing Flows INTERSPEECH 2021 Detection of Lexical Stress Errors in Non-Native (L2) English with Data Augmentation and Attention INTERSPEECH 2021 SynthASR: Unlocking Synthetic Data for Speech Recognition INTERSPEECH 2021 Improving the Expressiveness of Neural Vocoding with Non-Affine Normalizing Flows INTERSPEECH 2021 From Speech-to-Speech Translation to Automatic Dubbing ACL 2020 Evaluating and Optimizing Prosodic Alignment for Automatic Dubbing INTERSPEECH 2020 Interpretable Deep Learning Model for the Detection and Reconstruction of Dysarthric Speech INTERSPEECH 2019 Towards Achieving Robust Universal Neural Vocoding INTERSPEECH 2019 In Other News: a Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited Data NAACL 2019 Phrase Break Prediction for Long-Form Reading TTS: Exploiting Text Structure Information INTERSPEECH 2017 Continuous Expressive Speaking Styles Synthesis based on CVSM and MR-HMM COLING 2016