conftrace_

speech synthesis

757 papers

Explore in graph

Also known as

SSS SS

Co-occurring keywords

neural vocoder (126) voice conversion (259) text-to-speech synthesis (294) speech recognition (1226) deep neural network (1803) speech generation (98) low-resource language (2273) automatic speech recognition (1774) generative adversarial network (1944) neural network (6616)

Papers

Humanizing bionic voice: interactive demonstration of aesthetic design and control factors influencing the devices assembly and waveshape engineering INTERSPEECH 2022

Low-data? No problem: low-resource, language-agnostic conversational text-to-speech via F0-conditioned data augmentation INTERSPEECH 2022

TTS-by-TTS 2: Data-Selective Augmentation for Neural Speech Synthesis Using Ranking Support Vector Machine with Variational Autoencoder INTERSPEECH 2022

From Start to Finish: Latency Reduction Strategies for Incremental Speech Synthesis in Simultaneous Speech-to-Speech Translation INTERSPEECH 2022

Glottal inverse filtering based on articulatory synthesis and deep learning INTERSPEECH 2022

Towards Efficiently Learning Monotonic Alignments for Attention-based End-to-End Speech Recognition INTERSPEECH 2022

Fine-grained Noise Control for Multispeaker Speech Synthesis INTERSPEECH 2022

Unsupervised Text-to-Speech Synthesis by Unsupervised Automatic Speech Recognition INTERSPEECH 2022

Data Augmentation Using McAdams-Coefficient-Based Speaker Anonymization for Fake Audio Detection INTERSPEECH 2022

Normalization of code-switched text for speech synthesis INTERSPEECH 2022

Relationship between the acoustic time intervals and tongue movements of German diphthongs INTERSPEECH 2022

SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with Adaptive Noise Spectral Shaping INTERSPEECH 2022

SATTS: Speaker Attractor Text to Speech, Learning to Speak by Learning to Separate INTERSPEECH 2022

SiD-WaveFlow: A Low-Resource Vocoder Independent of Prior Knowledge INTERSPEECH 2022

Production characteristics of obstruents in WaveNet and older TTS systems INTERSPEECH 2022

FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS INTERSPEECH 2022

Towards Multi-Scale Speaking Style Modelling with Hierarchical Context Information for Mandarin Speech Synthesis INTERSPEECH 2022

Combining conversational speech with read speech to improve prosody in Text-to-Speech synthesis INTERSPEECH 2022

DelightfulTTS 2: End-to-End Speech Synthesis with Adversarial Vector-Quantized Auto-Encoders INTERSPEECH 2022

Gi2Pi Rule-based, index-preserving grapheme-to-phoneme transformations ACL 2022

Automatic Song Translation for Tonal Languages ACL 2022

Development of the Siberian Ingrian Finnish Speech Corpus ACL 2022

Text-Free Prosody-Aware Generative Spoken Language Modeling ACL 2022

A Multi-Stage Multi-Codebook VQ-VAE Approach to High-Performance Neural TTS INTERSPEECH 2022

Diffusion Generative Vocoder for Fullband Speech Synthesis Based on Weak Third-order SDE Solver INTERSPEECH 2022