Co-occurring keywords
Papers
Humanizing bionic voice: interactive demonstration of aesthetic design and control factors influencing the devices assembly and waveshape engineering
INTERSPEECH 2022
Low-data? No problem: low-resource, language-agnostic conversational text-to-speech via F0-conditioned data augmentation
INTERSPEECH 2022
TTS-by-TTS 2: Data-Selective Augmentation for Neural Speech Synthesis Using Ranking Support Vector Machine with Variational Autoencoder
INTERSPEECH 2022
From Start to Finish: Latency Reduction Strategies for Incremental Speech Synthesis in Simultaneous Speech-to-Speech Translation
INTERSPEECH 2022
Towards Efficiently Learning Monotonic Alignments for Attention-based End-to-End Speech Recognition
INTERSPEECH 2022
Fine-grained Noise Control for Multispeaker Speech Synthesis
INTERSPEECH 2022
Data Augmentation Using McAdams-Coefficient-Based Speaker Anonymization for Fake Audio Detection
INTERSPEECH 2022
Normalization of code-switched text for speech synthesis
INTERSPEECH 2022
Relationship between the acoustic time intervals and tongue movements of German diphthongs
INTERSPEECH 2022
SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with Adaptive Noise Spectral Shaping
INTERSPEECH 2022
SiD-WaveFlow: A Low-Resource Vocoder Independent of Prior Knowledge
INTERSPEECH 2022
Towards Multi-Scale Speaking Style Modelling with Hierarchical Context Information for Mandarin Speech Synthesis
INTERSPEECH 2022
Combining conversational speech with read speech to improve prosody in Text-to-Speech synthesis
INTERSPEECH 2022