Co-occurring keywords
Papers
MaskedSpeech: Context-aware Speech Synthesis with Masking Strategy
INTERSPEECH 2023
CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-Training
ACL 2023
FastDiff 2: Revisiting and Incorporating GANs and Diffusion Models in High-Fidelity Speech Synthesis
ACL 2023
Iteratively Improving Speech Recognition and Voice Conversion
INTERSPEECH 2023
The Role of Formant and Excitation Source Features in Perceived Naturalness of Low Resource Tribal Language TTS: An Empirical Study
INTERSPEECH 2023
FluentSpeech: Stutter-Oriented Automatic Speech Editing with Context-Aware Diffusion Models
ACL 2023
Decoupling Segmental and Prosodic Cues of Non-native Speech through Vector Quantization
INTERSPEECH 2023
DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding
INTERSPEECH 2023
Streaming Parrotron for on-device speech-to-speech conversion
INTERSPEECH 2023
EdenTTS: A Simple and Efficient Parallel Text-to-speech Architecture with Collaborative Duration-alignment Learning
INTERSPEECH 2023
ChatGPT-EDSS: Empathetic Dialogue Speech Synthesis Trained from ChatGPT-derived Context Word Embeddings
INTERSPEECH 2023
Speech inpainting: Context-based speech synthesis guided by video
INTERSPEECH 2023
ClArTTS: An Open-Source Classical Arabic Text-to-Speech Corpus
INTERSPEECH 2023