Papers
Remote Smartphone-Based Speech Collection: Acceptance and Barriers in Individuals with Major Depressive Disorder
Judith Dineley, Grace Lavelle, Daniel Leightley et al.
Representation Learning to Classify and Detect Adversarial Attacks Against Speaker and Speech Recognition Systems
Jesús Villalba, Sonal Joshi, Piotr Żelasko et al.
Residual Echo and Noise Cancellation with Feature Attention Module and Multi-Domain Loss Function
Jianjun Gu, Longbiao Cheng, Xingwei Sun et al.
Residual Energy-Based Models for End-to-End Speech Recognition
Qiujia Li, Yu Zhang, Bo Li et al.
Restoring Degraded Speech via a Modified Diffusion Model
Jianwei Zhang, Suren Jayasuriya, Visar Berisha
Rethinking End-to-End Evaluation of Decomposable Tasks: A Case Study on Spoken Language Understanding
Siddhant Arora, Alissa Ostapenko, Vijay Viswanathan et al.
Rethinking Evaluation in ASR: Are Our Models Robust Enough?
Tatiana Likhomanenko, Qiantong Xu, Vineel Pratap et al.
Revisiting Parity of Human vs. Machine Conversational Speech Transcription
Courtney Mansfield, Sara Ng, Gina-Anne Levow et al.
Revisiting Recall Effects of Filler Particles in German and English
Beeke Muhlack, Mikey Elmers, Heiner Drenhaus et al.
Rich Prosody Diversity Modelling with Phone-Level Mixture Density Network
Chenpeng Du, Kai Yu
Robust Command Recognition for Lithuanian Air Traffic Control Tower Utterances
Oliver Ohneiser, Seyyed Saeed Sarfjoo, Hartmut Helmke et al.
Robust Continuous On-Device Personalization for Automatic Speech Recognition
Khe Chai Sim, Angad Chandorkar, Fan Gao et al.
Robust End-to-End Speaker Diarization with Conformer and Additive Margin Penalty
Tsun-Yat Leung, Lahiru Samarakoon
Robust Laughter Detection in Noisy Environments
Jon Gillick, Wesley Deng, Kimiko Ryokai et al.
Robust Speaker Extraction Network Based on Iterative Refined Adaptation
Chengyun Deng, Shiqian Ma, Yongtao Sha et al.
Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training
Wei-Ning Hsu, Anuroop Sriram, Alexei Baevski et al.
ROXANNE Research Platform: Automate Criminal Investigations
Maël Fabien, Shantipriya Parida, Petr Motlicek et al.
RW-Resnet: A Novel Speech Anti-Spoofing Model Using Raw Waveform
Youxuan Ma, Zongze Ren, Shugong Xu
RyanSpeech: A Corpus for Conversational Text-to-Speech Synthesis
Rohola Zandie, Mohammad H. Mahoor, Julia Madsen et al.
S2VC: A Framework for Any-to-Any Voice Conversion with Self-Supervised Pretrained Representations
Jheng-hao Lin, Yist Y. Lin, Chung-Ming Chien et al.
Save Your Voice: Voice Banking and TTS for Anyone
Daniel Tihelka, Markéta Řezáčková, Martin Grůber et al.
Scaling Effect of Self-Supervised Speech Models
Jie Pu, Yuguang Yang, Ruirui Li et al.
Scaling Laws for Acoustic Models
Jasha Droppo, Oguz Elibol
Scaling Sparsemax Based Channel Selection for Speech Recognition with ad-hoc Microphone Arrays
Junqi Chen, Xiao-Lei Zhang
Scenario-Dependent Speaker Diarization for DIHARD-III Challenge
Yu-Xuan Wang, Jun Du, Maokui He et al.