Papers
8,761 papers found
CFVC: Conditional Filtering for Controllable Voice Conversion
Kou Tanaka, Takuhiro Kaneko, Hirokazu Kameoka et al.
ChatGPT-EDSS: Empathetic Dialogue Speech Synthesis Trained from ChatGPT-derived Context Word Embeddings
Yuki Saito, Shinnosuke Takamichi, Eiji Iimori et al.
Chinese EFL Learners’ Perception of English Prosodic Focus
Xinya Zhang, Ying Chen
ClArTTS: An Open-Source Classical Arabic Text-to-Speech Corpus
Ajinkya Kulkarni, Atharva Kulkarni, Sara Abedalmon'em Mohammad Shatnawi et al.
Classification of Multi-class Vowels and Fricatives From Patients Having Amyotrophic Lateral Sclerosis with Varied Levels of Dysarthria Severity
Chowdam Venkata Thirumala Kumar, Tanuka Bhattacharjee, Yamini Belur et al.
Classification of Vocal Intensity Category from Speech using the Wav2vec2 and Whisper Embeddings
Manila Kodali, Sudarsana Reddy Kadiri, Paavo Alku
Classifying Dementia in the Presence of Depression: A Cross-Corpus Study
Franziska Braun, Sebastian P. Bayerl, Paula A. Pérez-Toro et al.
Classifying depression symptom severity: Assessment of speech representations in personalized and generalized machine learning models.
Edward L. Campbell, Judith Dineley, Pauline Conde et al.
Classifying Rhoticity of /ɹ/ in Speech Sound Disorder using Age-and-Sex Normalized Formants
Nina R Benway, Jonathan L Preston, Asif Salekin et al.
CleanUNet 2: A Hybrid Speech Denoising Model on Waveform and Spectrogram
Zhifeng Kong, Wei Ping, Ambrish Dantrey et al.
CLRL-Tuning: A Novel Continual Learning Approach for Automatic Speech Recognition
Zhihan Wang, Feng Hou, Ruili Wang
CN-Celeb-AV: A Multi-Genre Audio-Visual Dataset for Person Recognition
Lantian Li, Xiaolou Li, Haoyu Jiang et al.
CNVVE: Dataset and Benchmark for Classifying Non-verbal Voice
Ramin Hedeshy, Raphael Menges, Steffen Staab
Coarticulation of Sibe Vowels and Dorsal Fricatives in Spontaneous Speech: An Acoustic Study
Jared Sharp, Matthew Faytak, Hasutai Fei Xiong Liu
CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning
Chutong Meng, Junyi Ao, Tom Ko et al.
Cochlear-implant Listeners Listening to Cochlear-implant Simulated Speech
Fanhui Kong, Nengheng Zheng, Xianren Wang et al.
Coherence Estimation Tracks Auditory Attention in Listeners with Hearing Impairment
Oskar Keding, Emina Alickovic, Martin A. Skoglund et al.
Combining acoustic and aerodynamic data collection: A perceptual evaluation of acoustic distortions
Amélie Elmerich, Jiayin Gao, Angelique Amelot et al.
Combining language corpora in a Japanese electromagnetic articulography database for acoustic-to-articulatory inversion
Tianfang Yan, Kikuo Maekawa, Yukiko Nota et al.
Combining Multilingual Resources and Models to Develop State-of-the-Art E2E ASR for Swedish
Lukas Mateju, Jan Nouza, Petr Červa et al.
Combining Multiple Multimodal Speech Features into an Interpretable Index Score for Capturing Disease Progression in Amyotrophic Lateral Sclerosis
Michael Neumann, Hardik Kothare, Vikram Ramanarayanan
ComedicSpeech: Text To Speech For Stand-up Comedies in Low-Resource Scenarios
Yuyue Wang, Huan Xiao, Yihan Wu et al.
CoMFLP: Correlation Measure Based Fast Search on ASR Layer Pruning
Wei Liu, Zhiyuan Peng, Tan Lee
CommonAccent: Exploring Large Acoustic Pretrained Models for Accent Classification Based on Common Voice
Juan Zuluaga-Gomez, Sara Ahmed, Danielius Visockas et al.
Comparing /b/ and /d/ with a Single Physical Model of the Human Vocal Tract to Visualize Droplets Produced while Speaking
Takayuki Arai, Tsukasa Yoshinaga, Akiyoshi Iida