Roberto Barra-Chicote
16 papers · 2016–2023 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+9 more ↓ Show less ↑
π Conference Polyglot (4) π Interdisciplinary Bridge π Renaissance Researcher (5) π§ Keyword Pioneer π Academic Marathon (7)
π
Academic Marathon
(7)
π
Cross-Pollinator
(8)
πΊοΈ
Taxonomy Completionist
(23)
π
Keyword Champion
(2)
π₯
Unstoppable
(5)
ποΈ
Keyword Collector
(59)
π
Century Club
(16)
π
Trend Setter
β‘
Prolific Year
(5)
Conferences
INTERSPEECH (13)
ACL (1)
COLING (1)
NAACL (1)
Top co-authors
Keywords
normalizing flow
(5)
speech synthesis
(4)
automatic dubbing
(4)
prosodic alignment
(3)
data augmentation
(2)
acoustic modeling
(2)
voice conversion
(2)
neural vocoding
(2)
text-to-speech synthesis
(2)
neural text-to-speech
(2)
speech-to-speech translation
(2)
speaker embedding
(2)
prosody modeling
(2)
probability density
(1)
automatic speech recognition
(1)
zero-shot learning
(1)
prosody analysis
(1)
latent space
(1)
non-native speech
(1)
attention mechanism
(1)
Papers
Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech
INTERSPEECH 2023
GlowVC: Mel-spectrogram space disentangling model for language-independent text-free voice conversion
INTERSPEECH 2022
Creating New Voices using Normalizing Flows
INTERSPEECH 2022
Prosodic alignment for off-screen automatic dubbing
INTERSPEECH 2022
Intra-Sentential Speaking Rate Control in Neural Text-To-Speech for Automatic Dubbing
INTERSPEECH 2021
Improving Multi-Speaker TTS Prosody Variance with a Residual Encoder and Normalizing Flows
INTERSPEECH 2021
Detection of Lexical Stress Errors in Non-Native (L2) English with Data Augmentation and Attention
INTERSPEECH 2021
SynthASR: Unlocking Synthetic Data for Speech Recognition
INTERSPEECH 2021
Improving the Expressiveness of Neural Vocoding with Non-Affine Normalizing Flows
INTERSPEECH 2021
From Speech-to-Speech Translation to Automatic Dubbing
ACL 2020
Evaluating and Optimizing Prosodic Alignment for Automatic Dubbing
INTERSPEECH 2020
Interpretable Deep Learning Model for the Detection and Reconstruction of Dysarthric Speech
INTERSPEECH 2019
Towards Achieving Robust Universal Neural Vocoding
INTERSPEECH 2019
In Other News: a Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited Data
NAACL 2019
Phrase Break Prediction for Long-Form Reading TTS: Exploiting Text Structure Information
INTERSPEECH 2017
Continuous Expressive Speaking Styles Synthesis based on CVSM and MR-HMM
COLING 2016