Papers
Semi-Supervised Acoustic Model Training for Five-Lingual Code-Switched ASR
Astik Biswas, Emre Yılmaz, Febe de Wet et al.
Semi-Supervised Audio Classification with Consistency-Based Regularization
Kangkang Lu, Chuan-Sheng Foo, Kah Kuan Teh et al.
Semi-Supervised Prosody Modeling Using Deep Gaussian Process Latent Variable Model
Tomoki Koriyama, Takao Kobayashi
Semi-Supervised Sequence-to-Sequence ASR Using Unpaired Speech and Text
Murali Karthick Baskar, Shinji Watanabe, Ramon Astudillo et al.
Semi-Supervised Voice Conversion with Amortized Variational Inference
Cory Stephenson, Gokce Keskin, Anil Thomas et al.
Sentence Prosody and Wh-Indeterminates in Taiwan Mandarin
Yu-Yin Hsu, Anqi Xu
Sequence-to-Sequence Learning via Attention Transfer for Incremental Speech Recognition
Sashi Novitasari, Andros Tjandra, Sakriani Sakti et al.
Sequence-to-Sequence Speech Recognition with Time-Depth Separable Convolutions
Awni Hannun, Ann Lee, Qiantong Xu et al.
Shallow-Fusion End-to-End Contextual Biasing
Ding Zhao, Tara N. Sainath, David Rybach et al.
Shortcut Connections Based Deep Speaker Embeddings for End-to-End Speaker Verification System
Soonshin Seo, Daniel Jun Rim, Minkyu Lim et al.
ShrinkML: End-to-End ASR Model Compression Using Reinforcement Learning
Łukasz Dudziak, Mohamed S. Abdelfattah, Ravichander Vipperla et al.
Simultaneous Denoising and Dereverberation for Low-Latency Applications Using Frame-by-Frame Online Unified Convolutional Beamformer
Tomohiro Nakatani, Keisuke Kinoshita
Simultaneous Detection and Localization of a Wake-Up Word Using Multi-Task Learning of the Duration and Endpoint
Takashi Maekaku, Yusuke Kida, Akihiko Sugiyama
Sincerity in Acted Speech: Presenting the Sincere Apology Corpus and Results
Alice Baird, Eduardo Coutinho, Julia Hirschberg et al.
Singing Voice Synthesis Using Deep Autoregressive Neural Networks for Acoustic Modeling
Yuan-Hao Yi, Yang Ai, Zhen-Hua Ling et al.
Slot Filling with Weighted Multi-Encoders for Out-of-Domain Values
Yuka Kobayashi, Takami Yoshida, Kenji Iwata et al.
SLP-AA: Tools for Sign Language Phonetic and Phonological Research
Roger Yu-Hsiang Lo, Kathleen Currie Hall
Small-Footprint Magic Word Detection Method Using Convolutional LSTM Neural Network
Taiki Yamamoto, Ryota Nishimura, Masayuki Misaki et al.
Sound Privacy: A Conversational Speech Corpus for Quantifying the Experience of Privacy
Pablo Pérez Zarazaga, Sneha Das, Tom Bäckström et al.
Sound Tools eXtended (STx) 5.0 — A Powerful Sound Analysis Tool Optimized for Speech
Anton Noll, Jonathan Stuefer, Nicola Klingler et al.
SparseSpeech: Unsupervised Acoustic Unit Discovery with Memory-Augmented Sequence Autoencoders
Benjamin Milde, Chris Biemann
Spatial and Spectral Fingerprint in the Brain: Speaker Identification from Single Trial MEG Signals
Debadatta Dash, Paul Ferrari, Jun Wang
Spatial Pyramid Encoding with Convex Length Normalization for Text-Independent Speaker Verification
Youngmoon Jung, Younggwan Kim, Hyungjun Lim et al.