Papers
Responsiveness, Sensitivity and Clinical Utility of Timing-Related Speech Biomarkers for Remote Monitoring of ALS Disease Progression
Hardik Kothare, Michael Neumann, Jackson Liscombe et al.
Rethinking Complex-Valued Deep Neural Networks for Monaural Speech Enhancement
Haibin Wu, Ke Tan, Buye Xu et al.
Rethinking Speech Recognition with A Multimodal Perspective via Acoustic and Semantic Cooperative Decoding
Tian-Hao Zhang, Hai-Bo Qin, Zhi-Hao Lai et al.
Rethinking the Visual Cues in Audio-Visual Speaker Extraction
Junjie Li, Meng Ge, Zexu Pan et al.
Rethinking Transfer and Auxiliary Learning for Improving Audio Captioning Transformer
Wooseok Shin, Hyun Joon Park, Jin Sob Kim et al.
Retraining-free Customized ASR for Enharmonic Words Based on a Named-Entity-Aware Model and Phoneme Similarity Estimation
Yui Sudo, Kazuya Hata, Kazuhiro Nakadai
Reverberation-Controllable Voice Conversion Using Reverberation Time Estimator
Yeonjong Choi, Chao Xie, Tomoki Toda
Reversible Neural Networks for Memory-Efficient Speaker Verification
Bei Liu, Yanmin Qian
Rhythmic Characteristics of L2 German Speech by Advanced Chinese Learners
Lindun Ge, Min Xu, Hongwei Ding
RMVPE: A Robust Model for Vocal Pitch Estimation in Polyphonic Music
Haojie Wei, Xueke Cao, Tangpeng Dan et al.
Robust Audio Anti-spoofing Countermeasure with Joint Training of Front-end and Back-end Models
Xingming Wang, Bang Zeng, Suo Hongbin et al.
Robust Audio Anti-Spoofing with Fusion-Reconstruction Learning on Multi-Order Spectrograms
Penghui Wen, Kun Hu, Wenxi Yue et al.
Robust Automatic Speech Recognition via WavAugment Guided Phoneme Adversarial Training
Gege Qi, Yuefeng Chen, Xiaofeng Mao et al.
Robust Feature Decoupling in Voice Conversion by Using Locality-Based Instance Normalization
Yewei Gu, Xianfeng Zhao, Xiaowei Yi
Robust Keyword Spotting for Noisy Environments by Leveraging Speech Enhancement and Speech Presence Probability
Chouchang Yang, Yashas Malur Saidutta, Rakshith Sharma Srinivasa et al.
Robust Prototype Learning for Anomalous Sound Detection
Xiao-Min Zeng, Yan Song, Ian McLoughlin et al.
Robust Self Supervised Speech Embeddings for Child-Adult Classification in Interactions involving Children with Autism
Rimita Lahiri, Tiantian Feng, Rajat Hebbar et al.
Robust Training for Speaker Verification against Noisy Labels
Zhihua Fang, Liang He, Hanhan Ma et al.
S2CD: Self-heuristic Speaker Content Disentanglement for Any-to-Any Voice Conversion
Pengfei Wei, Xiang Yin, Chunfeng Wang et al.
SALTTS: Leveraging Self-Supervised Speech Representations for improved Text-to-Speech Synthesis
Ramanan Sivaguru, Vasista Sai Lodagala, S Umesh
Sampling bias in NLU models: Impact and Mitigation
Zefei Li, Anil Ramakrishna, Anna Rumshisky et al.
SASPEECH: A Hebrew Single Speaker Dataset for Text To Speech and Voice Conversion
Orian Sharoni, Roee Shenberg, Erica Cooper
Scaling Laws for Discriminative Speech Recognition Rescoring Models
Yile Gu, Prashanth Gurunath Shivakumar, Jari Kolehmainen et al.
Score-balanced Loss for Multi-aspect Pronunciation Assessment
Heejin Do, Yunsu Kim, Gary Geunbae Lee