Papers
Rescoring Keyword Search Confidence Estimates with Graph-Based Re-Ranking Using Acoustic Word Embeddings
Anna Piunova, Eugen Beck, Ralf Schlüter et al.
Residual + Capsule Networks (ResCap) for Simultaneous Single-Channel Overlapped Keyword Recognition
Yan Xiong, Visar Berisha, Chaitali Chakrabarti
Reverse Transfer Learning: Can Word Embeddings Trained for Different NLP Tasks Improve Neural Language Models?
Lyan Verwimp, Jerome R. Bellegarda
Robust Bayesian and Light Neural Networks for Voice Spoofing Detection
Radosław Białobrzeski, Michał Kośmider, Mateusz Matuszewski et al.
Robust DOA Estimation Based on Convolutional Neural Network and Time-Frequency Masking
Wangyou Zhang, Ying Zhou, Yanmin Qian
Robust Keyword Spotting via Recycle-Pooling for Mobile Game
Shounan An, Youngsoo Kim, Hu Xu et al.
Robustness of Statistical Voice Conversion Based on Direct Waveform Modification Against Background Sounds
Yusuke Kurita, Kazuhiro Kobayashi, Kazuya Takeda et al.
Robust Sequence-to-Sequence Acoustic Modeling with Stepwise Monotonic Attention for Neural TTS
Mutian He, Yan Deng, Lei He
Robust Sound Recognition: A Neuromorphic Approach
Jibin Wu, Zihan Pan, Malu Zhang et al.
Robust Speech Emotion Recognition Under Different Encoding Conditions
Christopher Oates, Andreas Triantafyllopoulos, Ingmar Steiner et al.
R-Vectors: New Technique for Adaptation to Room Acoustics
Yuri Khokhlov, Alexander Zatvornitskiy, Ivan Medennikov et al.
RWTH ASR Systems for LibriSpeech: Hybrid vs Attention
Christoph Lüscher, Eugen Beck, Kazuki Irie et al.
Salient Speech Representations Based on Cloned Networks
W. Bastiaan Kleijn, Felicia S.C. Lim, Michael Chinen et al.
Sampling from Stochastic Finite Automata with Applications to CTC Decoding
Martin Jansche, Alexander Gutkin
SANTLR: Speech Annotation Toolkit for Low Resource Languages
Xinjian Li, Zhong Zhou, Siddharth Dalmia et al.
Say What? A Dataset for Exploring the Error Patterns That Two ASR Engines Make
Meredith Moore, Michael Saxon, Hemanth Venkateswara et al.
Scalable Multi Corpora Neural Language Models for ASR
Anirudh Raju, Denis Filimonov, Gautam Tiwari et al.
Selection and Training Schemes for Improving TTS Voice Built on Found Data
F.-Y. Kuo, I.C. Ouyang, S. Aryal et al.
Self-Attention for Speech Emotion Recognition
Lorenzo Tarantino, Philip N. Garner, Alexandros Lazaridis
Self Attention in Variational Sequential Learning for Summarization
Jen-Tzung Chien, Chun-Wei Wang
Self-Attention Transducers for End-to-End Speech Recognition
Zhengkun Tian, Jiangyan Yi, Jianhua Tao et al.
Self-Imitating Feedback Generation Using GAN for Computer-Assisted Pronunciation Training
Seung Hee Yang, Minhwa Chung
Self Multi-Head Attention for Speaker Recognition
Miquel India, Pooyan Safari, Javier Hernando
Self-Supervised Speaker Embeddings
Themos Stafylakis, Johan Rohdin, Oldřich Plchot et al.
Self-Teaching Networks
Liang Lu, Eric Sun, Yifan Gong