speech synthesis

753 papers

Explore in graph

Also known as

SSS SS TTS

Co-occurring keywords

neural vocoder (126) voice conversion (259) text-to-speech synthesis (293) speech recognition (1223) deep neural network (1801) speech generation (97) low-resource language (2234) automatic speech recognition (1764) generative adversarial network (1939) neural network (6616)

Papers

MaskedSpeech: Context-aware Speech Synthesis with Masking Strategy INTERSPEECH 2023

Japanese-to-English Simultaneous Dubbing Prototype ACL 2023

CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-Training ACL 2023

Non-parallel Accent Transfer based on Fine-grained Controllable Accent Modelling EMNLP 2023

FastDiff 2: Revisiting and Incorporating GANs and Diffusion Models in High-Fidelity Speech Synthesis ACL 2023

Iteratively Improving Speech Recognition and Voice Conversion INTERSPEECH 2023

HABLA: A Dataset of Latin American Spanish Accents for Voice Anti-spoofing INTERSPEECH 2023

The Role of Formant and Excitation Source Features in Perceived Naturalness of Low Resource Tribal Language TTS: An Empirical Study INTERSPEECH 2023

FluentSpeech: Stutter-Oriented Automatic Speech Editing with Context-Aware Diffusion Models ACL 2023

Decoupling Segmental and Prosodic Cues of Non-native Speech through Vector Quantization INTERSPEECH 2023

DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding INTERSPEECH 2023

Simple and Effective Unsupervised Speech Translation ACL 2023

UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units ACL 2023

Streaming Parrotron for on-device speech-to-speech conversion INTERSPEECH 2023

Evaluation of delexicalization methods for research on emotional speech INTERSPEECH 2023

RWEN-TTS: Relation-Aware Word Encoding Network for Natural Text-to-Speech Synthesis AAAI 2023

EdenTTS: A Simple and Efficient Parallel Text-to-speech Architecture with Collaborative Duration-alignment Learning INTERSPEECH 2023

ChatGPT-EDSS: Empathetic Dialogue Speech Synthesis Trained from ChatGPT-derived Context Word Embeddings INTERSPEECH 2023

StyleTalk: One-Shot Talking Head Generation with Controllable Speaking Styles AAAI 2023

Speech inpainting: Context-based speech synthesis guided by video INTERSPEECH 2023

Self-Supervised Solution to the Control Problem of Articulatory Synthesis INTERSPEECH 2023

Avocodo: Generative Adversarial Network for Artifact-Free Vocoder AAAI 2023

ClArTTS: An Open-Source Classical Arabic Text-to-Speech Corpus INTERSPEECH 2023

P-Flow: A Fast and Data-Efficient Zero-Shot TTS through Speech Prompting NIPS 2023

Reverberation-Controllable Voice Conversion Using Reverberation Time Estimator INTERSPEECH 2023