Papers
Semi-Supervised Voice Conversion with Amortized Variational Inference
Cory Stephenson, Gokce Keskin, Anil Thomas et al.
Sentence Prosody and Wh-Indeterminates in Taiwan Mandarin
Yu-Yin Hsu, Anqi Xu
Sequence-to-Sequence Learning via Attention Transfer for Incremental Speech Recognition
Sashi Novitasari, Andros Tjandra, Sakriani Sakti et al.
Sequence-to-Sequence Speech Recognition with Time-Depth Separable Convolutions
Awni Hannun, Ann Lee, Qiantong Xu et al.
Shallow-Fusion End-to-End Contextual Biasing
Ding Zhao, Tara N. Sainath, David Rybach et al.
Shortcut Connections Based Deep Speaker Embeddings for End-to-End Speaker Verification System
Soonshin Seo, Daniel Jun Rim, Minkyu Lim et al.
ShrinkML: End-to-End ASR Model Compression Using Reinforcement Learning
Łukasz Dudziak, Mohamed S. Abdelfattah, Ravichander Vipperla et al.
Simultaneous Denoising and Dereverberation for Low-Latency Applications Using Frame-by-Frame Online Unified Convolutional Beamformer
Tomohiro Nakatani, Keisuke Kinoshita
Simultaneous Detection and Localization of a Wake-Up Word Using Multi-Task Learning of the Duration and Endpoint
Takashi Maekaku, Yusuke Kida, Akihiko Sugiyama
Sincerity in Acted Speech: Presenting the Sincere Apology Corpus and Results
Alice Baird, Eduardo Coutinho, Julia Hirschberg et al.
Singing Voice Synthesis Using Deep Autoregressive Neural Networks for Acoustic Modeling
Yuan-Hao Yi, Yang Ai, Zhen-Hua Ling et al.
Slot Filling with Weighted Multi-Encoders for Out-of-Domain Values
Yuka Kobayashi, Takami Yoshida, Kenji Iwata et al.
SLP-AA: Tools for Sign Language Phonetic and Phonological Research
Roger Yu-Hsiang Lo, Kathleen Currie Hall
Small-Footprint Magic Word Detection Method Using Convolutional LSTM Neural Network
Taiki Yamamoto, Ryota Nishimura, Masayuki Misaki et al.
Sound Privacy: A Conversational Speech Corpus for Quantifying the Experience of Privacy
Pablo Pérez Zarazaga, Sneha Das, Tom Bäckström et al.
Sound Tools eXtended (STx) 5.0 — A Powerful Sound Analysis Tool Optimized for Speech
Anton Noll, Jonathan Stuefer, Nicola Klingler et al.
SparseSpeech: Unsupervised Acoustic Unit Discovery with Memory-Augmented Sequence Autoencoders
Benjamin Milde, Chris Biemann
Spatial and Spectral Fingerprint in the Brain: Speaker Identification from Single Trial MEG Signals
Debadatta Dash, Paul Ferrari, Jun Wang
Spatial Pyramid Encoding with Convex Length Normalization for Text-Independent Speaker Verification
Youngmoon Jung, Younggwan Kim, Hyungjun Lim et al.
Spatial, Temporal and Spectral Multiresolution Analysis for the INTERSPEECH 2019 ComParE Challenge
Marie-José Caraty, Claude Montacié
Spatio-Temporal Attention Pooling for Audio Scene Classification
Huy Phan, Oliver Y. Chén, Lam Pham et al.
Speaker Adaptation for Attention-Based End-to-End Speech Recognition
Zhong Meng, Yashesh Gaur, Jinyu Li et al.
Speaker Adaptation for Lip-Reading Using Visual Identity Vectors
Pujitha Appan Kandala, Abhinav Thanda, Dilip Kumar Margam et al.