Co-occurring keywords
Papers
Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
INTERSPEECH 2021
RW-Resnet: A Novel Speech Anti-Spoofing Model Using Raw Waveform
INTERSPEECH 2021
Cross-Lingual Voice Conversion with Disentangled Universal Linguistic Representations
INTERSPEECH 2021
Enriching Source Style Transfer in Recognition-Synthesis Based Non-Parallel Voice Conversion
INTERSPEECH 2021
Adversarially Learning Disentangled Speech Representations for Robust Multi-Factor Voice Conversion
INTERSPEECH 2021
Learning Explicit Prosody Models and Deep Speaker Embeddings for Atypical Voice Conversion
INTERSPEECH 2021
Adversarial Voice Conversion Against Neural Spoofing Detectors
INTERSPEECH 2021
Conformer Parrotron: A Faster and Stronger End-to-End Speech Conversion and Recognition Model for Atypical Speech
INTERSPEECH 2021
StarGANv2-VC: A Diverse, Unsupervised, Non-Parallel Framework for Natural-Sounding Voice Conversion
INTERSPEECH 2021
TVQVC: Transformer Based Vector Quantized Variational Autoencoder with CTC Loss for Voice Conversion
INTERSPEECH 2021
Many-to-Many Voice Conversion Based Feature Disentanglement Using Variational Autoencoder
INTERSPEECH 2021
VQMIVC: Vector Quantization and Mutual Information-Based Unsupervised Speech Representation Disentanglement for One-Shot Voice Conversion
INTERSPEECH 2021
StarGAN-VC+ASR: StarGAN-Based Non-Parallel Voice Conversion Regularized by Automatic Speech Recognition
INTERSPEECH 2021
Two-Pathway Style Embedding for Arbitrary Voice Conversion
INTERSPEECH 2021
CVC: Contrastive Learning for Non-Parallel Voice Conversion
INTERSPEECH 2021
Fine-Tuning Pre-Trained Voice Conversion Model for Adding New Target Speakers with Limited Data
INTERSPEECH 2021
Improving Robustness of One-Shot Voice Conversion with Deep Discriminative Speaker Encoder
INTERSPEECH 2021
One-Shot Voice Conversion with Speaker-Agnostic StarGAN
INTERSPEECH 2021