Papers
Chunking Defense for Adversarial Attacks on ASR
Yiwen Shao, Jesus Villalba, Sonal Joshi et al.
Class-Aware Distribution Alignment based Unsupervised Domain Adaptation for Speaker Verification
Hang-Rui Hu, Yan Song, Li-Rong Dai et al.
Classification of Accented English Using CNN Model Trained on Amplitude Mel-Spectrograms
Mariia Lesnichaia, Veranika Mikhailava, Natalia Bogach et al.
Clock Skew Robust Acoustic Echo Cancellation
Karim Helwani, Erfan Soltanmohammadi, Michael Mark Goodwin et al.
Clustering-based Wake Word Detection in Privacy-aware Acoustic Sensor Networks
Timm Koppelmann, Luca Becker, Alexandru Nelus et al.
CMGAN: Conformer-based Metric GAN for Speech Enhancement
Ruizhe Cao, Sherif Abdulatif, Bin Yang
CNN-based Audio Event Recognition for Automated Violence Classification and Rating for Prime Video Content
Mayank Sharma, Tarun Gupta, Kenny Qiu et al.
CoachLea: an Android Application to Evaluate the Speech Production and Perception of Children with Hearing Loss
P. Schäfer, P. A. Pérez-Toro, P. Klumpp et al.
Coarse-Grained Attention Fusion With Joint Training Framework for Complex Speech Enhancement and End-to-End Speech Recognition
Xuyi Zhuang, Lu Zhang, Zehua Zhang et al.
CoCA-MDD: A Coupled Cross-Attention based Framework for Streaming Mispronunciation Detection and Diagnosis
Nianzu Zheng, Liqun Deng, Wenyong Huang et al.
Combining conversational speech with read speech to improve prosody in Text-to-Speech synthesis
Johannah O'Mahony, Catherine Lai, Simon King
Combining Simple but Novel Data Augmentation Methods for Improving Conformer ASR
Ronit Damania, Christopher Homan, Emily Prud'hommeaux
Combining Spectral and Self-Supervised Features for Low Resource Speech Recognition and Translation
Dan Berrebbi, Jiatong Shi, Brian Yan et al.
Comparing 1-dimensional and 2-dimensional spectral feature representations in voice pathology detection using machine learning and deep learning classifiers
Farhad Javanmardi, Sudarsana Reddy Kadiri, Manila Kodali et al.
Comparison and Analysis of New Curriculum Criteria for End-to-End ASR
Georgios Karakasidis, Tamás Grósz, Mikko Kurimo
Comparison of 5 methods for the evaluation of intelligibility in mild to moderate French dysarthric speech
Cécile Fougeron, Nicolas Audibert, Ina Kodrasi et al.
Comparison of Models for Detecting Off-Putting Speaking Styles
Diego Aguirre, Nigel Ward, Jonathan E. Avila et al.
Comparison of Unsupervised Learning and Supervised Learning with Noisy Labels for Low-Resource Speech Recognition
Yanick Schraner, Christian Scheller, Michel Plüss et al.
Compensation in Verbal and Nonverbal Communication after Total Laryngectomy
Marise Neijman, Femke Hof, Noelle Oosterom et al.
Complex Frequency Domain Linear Prediction: A Tool to Compute Modulation Spectrum of Speech
Samik Sadhu, Hynek Hermansky
Complex Paralinguistic Analysis of Speech: Predicting Gender, Emotions and Deception in a Hierarchical Framework
Alena Velichko, Maxim Markitantov, Heysem Kaya et al.
Complex sounds and cross-language influence: The case of ejectives in Omani Mehri
Rachid Ridouane, Philipp Buech
Complex-Valued Time-Frequency Self-Attention for Speech Dereverberation
Vinay Kothapally, John H.L. Hansen
Compute Cost Amortized Transformer for Streaming ASR
Yi Xie, Jonathan J. Macoskey, Martin Radfar et al.