voice conversion

259 papers

Explore in graph

Also known as

SVC VC EVC

Co-occurring keywords

speech synthesis (753) zero-shot learning (3637) variational autoencoder (1282) speaker identity (74) generative adversarial network (1939) self-supervised learning (3751) speaker verification (577) speaker similarity (35) automatic speech recognition (1764) speaker embedding (350)

Papers

HybridVC: Efficient Voice Style Conversion with Text and Audio Prompts INTERSPEECH 2024

Anonymising Elderly and Pathological Speech: Voice Conversion Using DDSP and Query-by-Example INTERSPEECH 2024

Noise-Robust Voice Conversion by Conditional Denoising Training Using Latent Variables of Recording Quality and Environment INTERSPEECH 2024

Improving child speech recognition with augmented child-like speech INTERSPEECH 2024

Utilizing Adaptive Global Response Normalization and Cluster-Based Pseudo Labels for Zero-Shot Voice Conversion INTERSPEECH 2024

RW-VoiceShield: Raw Waveform-based Adversarial Attack on One-shot Voice Conversion INTERSPEECH 2024

X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion INTERSPEECH 2024

Knowledge Distillation from Self-Supervised Representation Learning Model with Discrete Speech Units for Any-to-Any Streaming Voice Conversion INTERSPEECH 2024

Phoneme Hallucinator: One-Shot Voice Conversion via Set Expansion AAAI 2024

PRVAE-VC2: Non-Parallel Voice Conversion by Distillation of Speech Representations INTERSPEECH 2024

Unsupervised Domain Adaptation for Speech Emotion Recognition using K-Nearest Neighbors Voice Conversion INTERSPEECH 2024

Enhancing Polyglot Voices by Leveraging Cross-Lingual Fine-Tuning in Any-to-One Voice Conversion EMNLP 2024

Neural Codec Language Models for Disentangled and Textless Voice Conversion INTERSPEECH 2024

DiffVC+: Improving Diffusion-based Voice Conversion for Speaker Anonymization INTERSPEECH 2024

Towards Naturalistic Voice Conversion: NaturalVoices Dataset with an Automatic Processing Pipeline INTERSPEECH 2024

Improvement Speaker Similarity for Zero-Shot Any-to-Any Voice Conversion of Whispered and Regular Speech INTERSPEECH 2024

DreamVoice: Text-Guided Voice Conversion INTERSPEECH 2024

CtrSVDD: A Benchmark Dataset and Baseline Analysis for Controlled Singing Voice Deepfake Detection INTERSPEECH 2024

SRC4VC: Smartphone-Recorded Corpus for Voice Conversion Benchmark INTERSPEECH 2024

StyleSinger: Style Transfer for Out-of-Domain Singing Voice Synthesis AAAI 2024

Philippine Languages Database: A Multilingual Speech Corpora for Developing Systems for Low-Resource Languages COLING 2024

Improving Copy-Synthesis Anti-Spoofing Training Method with Rhythm and Speaker Perturbation INTERSPEECH 2024

VoxFlow AI: wearable voice converter for atypical speech INTERSPEECH 2024

Spoof Diarization: "What Spoofed When" in Partially Spoofed Audio INTERSPEECH 2024

Reverberation-Controllable Voice Conversion Using Reverberation Time Estimator INTERSPEECH 2023