Papers
Rapid Enhancement of NLP Systems by Acquisition of Data in Correlated Domains
Tejas Udayakumar, Kinnera Saranu, Mayuresh Sanjay Oak et al.
Rapid RNN-T Adaptation Using Personalized Speech Synthesis and Neural Language Generator
Yan Huang, Jinyu Li, Lei He et al.
Raw Sign and Magnitude Spectra for Multi-Head Acoustic Modelling
Erfan Loweimi, Peter Bell, Steve Renals
Raw Speech Waveform Based Classification of Patients with ALS, Parkinson’s Disease and Healthy Controls Using CNN-BLSTM
Jhansi Mallela, Aravind Illa, Yamini Belur et al.
Real-Time, Full-Band, Online DNN-Based Voice Conversion System Using a Single CPU
Takaaki Saeki, Yuki Saito, Shinnosuke Takamichi et al.
Real-Time Single-Channel Deep Neural Network-Based Speech Enhancement on Edge Devices
Nikhil Shankar, Gautam Shreedhar Bhat, Issa M.S. Panahi
Real Time Speech Enhancement in the Waveform Domain
Alexandre Défossez, Gabriel Synnaeve, Yossi Adi
Recognising Emotions in Dysarthric Speech Using Typical Speech Data
Lubna Alhinti, Stuart Cunningham, Heidi Christensen
Recognition-Synthesis Based Non-Parallel Voice Conversion with Adversarial Learning
Jing-Xuan Zhang, Zhen-Hua Ling, Li-Rong Dai
Reformer-TTS: Neural Speech Synthesis with Reformer Network
Hyeong Rae Ihm, Joun Yeop Lee, Byoung Jin Choi et al.
Regional Resonance of the Lower Vocal Tract and its Contribution to Speaker Characteristics
Lin Zhang, Kiyoshi Honda, Jianguo Wei et al.
Relational Teacher Student Learning with Neural Label Embedding for Device Adaptation in Acoustic Scene Classification
Hu Hu, Sabato Marco Siniscalchi, Yannan Wang et al.
Relative Positional Encoding for Speech Recognition and Direct Translation
Ngoc-Quan Pham, Thanh-Le Ha, Tuan-Nam Nguyen et al.
Releasing a Toolkit and Comparing the Performance of Language Embeddings Across Various Spoken Language Identification Datasets
Matias Lindgren, Tommi Jauhiainen, Mikko Kurimo
Removing Bias with Residual Mixture of Multi-View Attention for Speech Emotion Recognition
Md. Asif Jalal, Rosanna Milner, Thomas Hain et al.
Representation Based Meta-Learning for Few-Shot Spoken Intent Recognition
Ashish Mittal, Samarth Bharadwaj, Shreya Khare et al.
Rescore in a Flash: Compact, Cache Efficient Hashing Data Structures for n-Gram Language Models
Grant P. Strimel, Ariya Rastrow, Gautam Tiwari et al.
Resource-Adaptive Deep Learning for Visual Speech Recognition
Alexandros Koumparoulis, Gerasimos Potamianos, Samuel Thomas et al.
Reverberation Modeling for Source-Filter-Based Neural Vocoder
Yang Ai, Xin Wang, Junichi Yamagishi et al.
Re-Weighted Interval Loss for Handling Data Imbalance Problem of End-to-End Keyword Spotting
Kun Zhang, Zhiyong Wu, Daode Yuan et al.
Rhythmic Convergence in Canadian French Varieties?
Svetlana Kaminskaïa
Risk Forecasting from Earnings Calls Acoustics and Network Correlations
Ramit Sawhney, Arshiya Aggarwal, Piyush Khanna et al.