voice conversion

259 papers

Explore in graph

Also known as

SVC VC EVC

Co-occurring keywords

speech synthesis (753) zero-shot learning (3637) variational autoencoder (1282) speaker identity (74) generative adversarial network (1939) self-supervised learning (3751) speaker verification (577) speaker similarity (35) automatic speech recognition (1764) speaker embedding (350)

Papers

Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration INTERSPEECH 2021

RW-Resnet: A Novel Speech Anti-Spoofing Model Using Raw Waveform INTERSPEECH 2021

VoiceMixer: Adversarial Voice Style Mixup NIPS 2021

Cross-Lingual Voice Conversion with Disentangled Universal Linguistic Representations INTERSPEECH 2021

Enriching Source Style Transfer in Recognition-Synthesis Based Non-Parallel Voice Conversion INTERSPEECH 2021

Adversarially Learning Disentangled Speech Representations for Robust Multi-Factor Voice Conversion INTERSPEECH 2021

Learning Explicit Prosody Models and Deep Speaker Embeddings for Atypical Voice Conversion INTERSPEECH 2021

Neural Analysis and Synthesis: Reconstructing Speech from Self-Supervised Representations NIPS 2021

Global Prosody Style Transfer Without Text Transcriptions ICML 2021

Adversarial Voice Conversion Against Neural Spoofing Detectors INTERSPEECH 2021

Conformer Parrotron: A Faster and Stronger End-to-End Speech Conversion and Recognition Model for Atypical Speech INTERSPEECH 2021

Learning Paralinguistic Features from Audiobooks through Style Voice Conversion NAACL 2021

StarGANv2-VC: A Diverse, Unsupervised, Non-Parallel Framework for Natural-Sounding Voice Conversion INTERSPEECH 2021

TVQVC: Transformer Based Vector Quantized Variational Autoencoder with CTC Loss for Voice Conversion INTERSPEECH 2021

Many-to-Many Voice Conversion Based Feature Disentanglement Using Variational Autoencoder INTERSPEECH 2021

VQMIVC: Vector Quantization and Mutual Information-Based Unsupervised Speech Representation Disentanglement for One-Shot Voice Conversion INTERSPEECH 2021

An Exemplar Selection Algorithm for Native-Nonnative Voice Conversion INTERSPEECH 2021

StarGAN-VC+ASR: StarGAN-Based Non-Parallel Voice Conversion Regularized by Automatic Speech Recognition INTERSPEECH 2021

Two-Pathway Style Embedding for Arbitrary Voice Conversion INTERSPEECH 2021

Non-Parallel Any-to-Many Voice Conversion by Replacing Speaker Statistics INTERSPEECH 2021

CVC: Contrastive Learning for Non-Parallel Voice Conversion INTERSPEECH 2021

Fine-Tuning Pre-Trained Voice Conversion Model for Adding New Target Speakers with Limited Data INTERSPEECH 2021

Improving Robustness of One-Shot Voice Conversion with Deep Discriminative Speaker Encoder INTERSPEECH 2021

One-Shot Voice Conversion with Speaker-Agnostic StarGAN INTERSPEECH 2021

Towards Natural Bilingual and Code-Switched Speech Synthesis Based on Mix of Monolingual Recordings and Cross-Lingual Voice Conversion INTERSPEECH 2020