Papers
Robust DOA Estimation Based on Convolutional Neural Network and Time-Frequency Masking
Wangyou Zhang, Ying Zhou, Yanmin Qian
Robust Keyword Spotting via Recycle-Pooling for Mobile Game
Shounan An, Youngsoo Kim, Hu Xu et al.
Robustness of Statistical Voice Conversion Based on Direct Waveform Modification Against Background Sounds
Yusuke Kurita, Kazuhiro Kobayashi, Kazuya Takeda et al.
Robust Sequence-to-Sequence Acoustic Modeling with Stepwise Monotonic Attention for Neural TTS
Mutian He, Yan Deng, Lei He
Robust Sound Recognition: A Neuromorphic Approach
Jibin Wu, Zihan Pan, Malu Zhang et al.
Robust Speech Emotion Recognition Under Different Encoding Conditions
Christopher Oates, Andreas Triantafyllopoulos, Ingmar Steiner et al.
R-Vectors: New Technique for Adaptation to Room Acoustics
Yuri Khokhlov, Alexander Zatvornitskiy, Ivan Medennikov et al.
RWTH ASR Systems for LibriSpeech: Hybrid vs Attention
Christoph Lüscher, Eugen Beck, Kazuki Irie et al.
Salient Speech Representations Based on Cloned Networks
W. Bastiaan Kleijn, Felicia S.C. Lim, Michael Chinen et al.
Sampling from Stochastic Finite Automata with Applications to CTC Decoding
Martin Jansche, Alexander Gutkin
SANTLR: Speech Annotation Toolkit for Low Resource Languages
Xinjian Li, Zhong Zhou, Siddharth Dalmia et al.
Say What? A Dataset for Exploring the Error Patterns That Two ASR Engines Make
Meredith Moore, Michael Saxon, Hemanth Venkateswara et al.
Scalable Multi Corpora Neural Language Models for ASR
Anirudh Raju, Denis Filimonov, Gautam Tiwari et al.
Selection and Training Schemes for Improving TTS Voice Built on Found Data
F.-Y. Kuo, I.C. Ouyang, S. Aryal et al.
Self-Attention for Speech Emotion Recognition
Lorenzo Tarantino, Philip N. Garner, Alexandros Lazaridis
Self Attention in Variational Sequential Learning for Summarization
Jen-Tzung Chien, Chun-Wei Wang
Self-Attention Transducers for End-to-End Speech Recognition
Zhengkun Tian, Jiangyan Yi, Jianhua Tao et al.
Self-Imitating Feedback Generation Using GAN for Computer-Assisted Pronunciation Training
Seung Hee Yang, Minhwa Chung
Self Multi-Head Attention for Speaker Recognition
Miquel India, Pooyan Safari, Javier Hernando
Self-Supervised Speaker Embeddings
Themos Stafylakis, Johan Rohdin, Oldřich Plchot et al.
Self-Teaching Networks
Liang Lu, Eric Sun, Yifan Gong
Semi-Supervised Acoustic Model Training for Five-Lingual Code-Switched ASR
Astik Biswas, Emre Yılmaz, Febe de Wet et al.
Semi-Supervised Audio Classification with Consistency-Based Regularization
Kangkang Lu, Chuan-Sheng Foo, Kah Kuan Teh et al.
Semi-Supervised Prosody Modeling Using Deep Gaussian Process Latent Variable Model
Tomoki Koriyama, Takao Kobayashi
Semi-Supervised Sequence-to-Sequence ASR Using Unpaired Speech and Text
Murali Karthick Baskar, Shinji Watanabe, Ramon Astudillo et al.