Co-occurring keywords
Papers
When Is TTS Augmentation Through a Pivot Language Useful?
INTERSPEECH 2022
AdaVocoder: Adaptive Vocoder for Custom Voice
INTERSPEECH 2022
L2-GEN: A Neural Phoneme Paraphrasing Approach to L2 Speech Synthesis for Mispronunciation Diagnosis
INTERSPEECH 2022
Data Augmentation Using McAdams-Coefficient-Based Speaker Anonymization for Fake Audio Detection
INTERSPEECH 2022
Opencpop: A High-Quality Open Source Chinese Popular Song Corpus for Singing Voice Synthesis
INTERSPEECH 2022
DDOS: A MOS Prediction Framework utilizing Domain Adaptive Pre-training and Distribution of Opinion Scores
INTERSPEECH 2022
SingAug: Data Augmentation for Singing Voice Synthesis with Cycle-consistent Training Strategy
INTERSPEECH 2022
MSR-NV: Neural Vocoder Using Multiple Sampling Rates
INTERSPEECH 2022
TTS-by-TTS 2: Data-Selective Augmentation for Neural Speech Synthesis Using Ranking Support Vector Machine with Variational Autoencoder
INTERSPEECH 2022
Synthesizing Near Native-accented Speech for a Non-native Speaker by Imitating the Pronunciation and Prosody of a Native Speaker
INTERSPEECH 2022
Self supervised learning for robust voice cloning
INTERSPEECH 2022
Speaker Anonymization with Phonetic Intermediate Representations
INTERSPEECH 2022
Low-data? No problem: low-resource, language-agnostic conversational text-to-speech via F0-conditioned data augmentation
INTERSPEECH 2022
SOMOS: The Samsung Open MOS Dataset for the Evaluation of Neural Text-to-Speech Synthesis
INTERSPEECH 2022
Back to the Future: Extending the Blizzard Challenge 2013
INTERSPEECH 2022
REYD – The First Yiddish Text-to-Speech Dataset and System
INTERSPEECH 2022
Automatic Evaluation of Speaker Similarity
INTERSPEECH 2022