Co-occurring keywords
Papers
TSP-TTS: Text-based Style Predictor with Residual Vector Quantization for Expressive Text-to-Speech
INTERSPEECH 2024
JenGAN: Stacked Shifted Filters in GAN-Based Speech Synthesis
INTERSPEECH 2024
Enhancing Out-of-Vocabulary Performance of Indian TTS Systems for Practical Applications through Low-Effort Data Strategies
INTERSPEECH 2024
Towards Realistic Emotional Voice Conversion using Controllable Emotional Intensity
INTERSPEECH 2024
Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model
EMNLP 2024
TacoLM: GaTed Attention Equipped Codec Language Model are Efficient Zero-Shot Text to Speech Synthesizers
INTERSPEECH 2024
Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices.
INTERSPEECH 2024
Controllable Generation of Artificial Speaker Embeddings through Discovery of Principal Directions
INTERSPEECH 2023
Towards Robust FastSpeech 2 by Modelling Residual Multimodality
INTERSPEECH 2023
ZET-Speech: Zero-shot adaptive Emotion-controllable Text-to-Speech Synthesis with Diffusion and Style-based Models
INTERSPEECH 2023
OverFlow: Putting flows on top of neural transducers for better TTS
INTERSPEECH 2023