Co-occurring keywords
Papers
Articulatory synthesis using representations learnt through phonetic label-aware contrastive loss
INTERSPEECH 2024
Make-A-Voice: Revisiting Voice Large Language Models as Scalable Multilingual and Multitask Learners
ACL 2024
Enhancing Out-of-Vocabulary Performance of Indian TTS Systems for Practical Applications through Low-Effort Data Strategies
INTERSPEECH 2024
Highly Intelligible Speaker-Independent Articulatory Synthesis
INTERSPEECH 2024
FakeSound: Deepfake General Audio Detection
INTERSPEECH 2024
QGAN: Low Footprint Quaternion Neural Vocoder for Speech Synthesis
INTERSPEECH 2024
BiVocoder: A Bidirectional Neural Vocoder Integrating Feature Extraction and Waveform Generation
INTERSPEECH 2024
Towards Naturalistic Voice Conversion: NaturalVoices Dataset with an Automatic Processing Pipeline
INTERSPEECH 2024
Towards EMG-to-Speech with Necklace Form Factor
INTERSPEECH 2024
GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks
NIPS 2024
1000 African Voices: Advancing inclusive multi-speaker multi-accent speech synthesis
INTERSPEECH 2024
JenGAN: Stacked Shifted Filters in GAN-Based Speech Synthesis
INTERSPEECH 2024
Stress transfer in speech-to-speech machine translation
INTERSPEECH 2024