Papers
Semi-Supervised End-to-End Speech Recognition
Shigeki Karita, Shinji Watanabe, Tomoharu Iwata et al.
Semi-supervised Learning for Information Extraction from Dialogue
Anjuli Kannan, Kai Chen, Diana Jaunzeikare et al.
Semi-tied Units for Efficient Gating in LSTM and Highway Networks
Chao Zhang, Philip Woodland
Sensorimotor Response to Tongue Displacement Imagery by Talkers with Parkinson’s Disease
William Katz, Patrick Reidy, Divya Prabhakaran
Sequence-to-sequence Neural Network Model with 2D Attention for Learning Japanese Pitch Accents
Antoine Bruguier, Heiga Zen, Arkady Arkhangorodsky
Should Code-switching Models Be Asymmetric?
Barbara E. Bullock, Gualberto Guzmán, Jacqueline Serigos et al.
Siamese Recurrent Auto-Encoder Representation for Query-by-Example Spoken Term Detection
Ziwei Zhu, Zhiyong Wu, Runnan Li et al.
Single-Channel Dereverberation Using Direct MMSE Optimization and Bidirectional LSTM Networks
Wolfgang Mack, Soumitro Chakrabarty, Fabian-Robert Stöter et al.
Single-channel Late Reverberation Power Spectral Density Estimation Using Denoising Autoencoders
Ina Kodrasi, Hervé Bourlard
Single-channel Speech Dereverberation via Generative Adversarial Training
Chenxing Li, Tieqiang Wang, Shuang Xu et al.
Slot Filling with Delexicalized Sentence Generation
Youhyun Shin, Kang Min Yoo, Sang-goo Lee
Spanish Statistical Parametric Speech Synthesis Using a Neural Vocoder
Antonio Bonafonte, Santiago Pascual, Georgina Dorca
Speaker Activity Detection and Minimum Variance Beamforming for Source Separation
Enea Ceolini, Jithendar Anumula, Adrian Huber et al.
Speaker Adaptation and Adaptive Training for Jointly Optimised Tandem Systems
Yu Wang, Chao Zhang, Mark Gales et al.
Speaker Adaptive Audio-Visual Fusion for the Open-Vocabulary Section of AVICAR
Leda Sari, Mark Hasegawa-Johnson, Kumaran S et al.
Speaker Adaptive Training and Mixup Regularization for Neural Network Acoustic Models in Automatic Speech Recognition
Natalia Tomashenko, Yuri Khokhlov, Yannick Estève
Speaker Diarization with Enhancing Speech for the First DIHARD Challenge
Lei Sun, Jun Du, Chao Jiang et al.
Speaker Embedding Extraction with Phonetic Information
Yi Liu, Liang He, Jia Liu et al.
Speaker-independent Raw Waveform Model for Glottal Excitation
Lauri Juvela, Vassilis Tsiaras, Bajibabu Bollepalli et al.
Speaker Recognition with Nonlinear Distortion: Clipping Analysis and Impact
Wei Xia, John H.L. Hansen
Speaker-specific Structure in German Voiceless Stop Voice Onset Times
Marc Antony Hullebus, Stephen Tobin, Adamantios Gafos
Speech2Vec: A Sequence-to-Sequence Framework for Learning Word Embeddings from Speech
Yu-An Chung, James Glass