Papers
Pruned RNN-T for fast, memory-efficient ASR training
Fangjun Kuang, Liyong Guo, Wei Kang et al.
Pseudo Label Is Better Than Human Label
Dongseong Hwang, Khe Chai Sim, Zhouyuan Huo et al.
Pushing the limits of raw waveform speaker recognition
Jee-weon Jung, Youjin Kim, Hee-Soo Heo et al.
QbyE-MLPMixer: Query-by-Example Open-Vocabulary Keyword Spotting using MLPMixer
Jinmiao Huang, Waseem Gharbieh, Qianhui Wan et al.
QDPN - Quasi-dual-path Network for single-channel Speech Separation
Joel Rixen, Matthias Renz
Qualitative Evaluation of Language Model Rescoring in Automatic Speech Recognition
Thibault Bañeras Roux, Mickael Rouvier, Jane Wottawa et al.
Radio2Speech: High Quality Speech Recovery from Radio Frequency Signals
Running Zhao, Jiangtao Yu, Tingle Li et al.
RaDur: A Reference-aware and Duration-robust Network for Target Sound Detection
Dongchao Yang, Helin Wang, Zhongjie Ye et al.
Rainbow Keywords: Efficient Incremental Learning for Online Spoken Keyword Spotting
Yang Xiao, Nana Hou, Eng Siong Chng
RCT: Random consistency training for semi-supervised sound event detection
Nian Shao, Erfan Loweimi, Xiaofei Li
Real-Time Monitoring of Silences in Contact Center Conversations
Digvijay Ingle, Ayush Kumar, Krishnachaitanya Gogineni et al.
Real-Time Packet Loss Concealment With Mixed Generative and Predictive Model
Jean-Marc Valin, Ahmed Mustafa, Christopher Montgomery et al.
Recent improvements of ASR models in the face of adversarial attacks
Raphael Olivier, Bhiksha Raj
Recording and timing vocal responses in online experimentation
Katrina Kechun Li, Julia Schwarz, Jasper Hong Sim et al.
Recurrent multi-head attention fusion network for combining audio and text for speech emotion recognition
Chung-Soo Ahn, Chamara Kasun, Sunil Sivadas et al.
Reducing Domain mismatch in Self-supervised speech pre-training
Murali Karthick Baskar, Andrew Rosenberg, Bhuvana Ramabhadran et al.
Reducing Geographic Disparities in Automatic Speech Recognition via Elastic Weight Consolidation
Viet Anh Trinh, Pegah Ghahremani, Brian King et al.
reducing multilingual context confusion for end-to-end code-switching automatic speech recognition
Shuai Zhang, Jiangyan Yi, Zhengkun Tian et al.
Reducing Offensive Replies in Open Domain Dialogue Systems
Naokazu Uchida, Takeshi Homma, Makoto Iwayama et al.
RefineGAN: Universally Generating Waveform Better than Ground Truth with Highly Accurate Pitch and Intensity Responses
Shengyuan Xu, Wenxiao Zhao, Jing Guo
Refining DNN-based Mask Estimation using CGMM-based EM Algorithm for Multi-channel Noise Reduction
Julitta Bartolewska, Stanisław Kacprzak, Konrad Kowalczyk
RefTextLAS: Reference Text Biased Listen, Attend, and Spell Model For Accurate Reading Evaluation
Phani Sankar Nidadavolu, Na Xu, Nick Jutila et al.
Regularizing Transformer-based Acoustic Models by Penalizing Attention Weights
Munhak Lee, Joon-Hyuk Chang, Sang-Eon Lee et al.
Relating the fundamental frequency of speech with EEG using a dilated convolutional network
Corentin Puffay, Jana Van Canneyt, Jonas Vanthornhout et al.