Co-occurring keywords
Papers
Rasa: Building Expressive Speech Synthesis Systems for Indian Languages in Low-resource Settings
INTERSPEECH 2024
Direct Speech Synthesis from Non-Invasive, Neuromagnetic Signals
INTERSPEECH 2024
Leveraging the Interplay between Syntactic and Acoustic Cues for Optimizing Korean TTS Pause Formation
COLING 2024
GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks
NIPS 2024
Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model
EMNLP 2024
Seamless Language Expansion: Enhancing Multilingual Mastery in Self-Supervised Models
INTERSPEECH 2024
An Automated End-to-End Open-Source Software for High-Quality Text-to-Speech Dataset Generation
COLING 2024
PitchFlow: adding pitch control to a Flow-matching based TTS model
INTERSPEECH 2024
Well, what can you do with messy data? Exploring the prosody and pragmatic function of the discourse marker "well" with found data and speech synthesis
INTERSPEECH 2024
Production of phrases by mechanical models of the human vocal tract
INTERSPEECH 2024
Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices.
INTERSPEECH 2024
Evaluating Text-to-Speech Synthesis from a Large Discrete Token-based Speech Language Model
COLING 2024
CodecFake: Enhancing Anti-Spoofing Models Against Deepfake Audios from Codec-Based Speech Synthesis Systems
INTERSPEECH 2024
TunArTTS: Tunisian Arabic Text-To-Speech Corpus
COLING 2024