Co-occurring keywords
Papers
Anonymising Elderly and Pathological Speech: Voice Conversion Using DDSP and Query-by-Example
INTERSPEECH 2024
Noise-Robust Voice Conversion by Conditional Denoising Training Using Latent Variables of Recording Quality and Environment
INTERSPEECH 2024
Improving child speech recognition with augmented child-like speech
INTERSPEECH 2024
Utilizing Adaptive Global Response Normalization and Cluster-Based Pseudo Labels for Zero-Shot Voice Conversion
INTERSPEECH 2024
X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion
INTERSPEECH 2024
Knowledge Distillation from Self-Supervised Representation Learning Model with Discrete Speech Units for Any-to-Any Streaming Voice Conversion
INTERSPEECH 2024
Unsupervised Domain Adaptation for Speech Emotion Recognition using K-Nearest Neighbors Voice Conversion
INTERSPEECH 2024
Enhancing Polyglot Voices by Leveraging Cross-Lingual Fine-Tuning in Any-to-One Voice Conversion
EMNLP 2024
Towards Naturalistic Voice Conversion: NaturalVoices Dataset with an Automatic Processing Pipeline
INTERSPEECH 2024
Improvement Speaker Similarity for Zero-Shot Any-to-Any Voice Conversion of Whispered and Regular Speech
INTERSPEECH 2024
DreamVoice: Text-Guided Voice Conversion
INTERSPEECH 2024
CtrSVDD: A Benchmark Dataset and Baseline Analysis for Controlled Singing Voice Deepfake Detection
INTERSPEECH 2024
SRC4VC: Smartphone-Recorded Corpus for Voice Conversion Benchmark
INTERSPEECH 2024
Improving Copy-Synthesis Anti-Spoofing Training Method with Rhythm and Speaker Perturbation
INTERSPEECH 2024
VoxFlow AI: wearable voice converter for atypical speech
INTERSPEECH 2024
Spoof Diarization: "What Spoofed When" in Partially Spoofed Audio
INTERSPEECH 2024