Papers
Speech Spectrogram Estimation from Intracranial Brain Activity Using a Quantization Approach
Miguel Angrick, Christian Herff, Garett Johnson et al.
Speech to Semantics: Improve ASR and NLU Jointly via All-Neural Interfaces
Milind Rao, Anirudh Raju, Pranav Dheram et al.
Speech-to-Singing Conversion Based on Boundary Equilibrium GAN
Da-Yi Wu, Yi-Hsuan Yang
Speech to Text Adaptation: Towards an Efficient Cross-Modal Distillation
Won Ik Cho, Donghyun Kwak, Ji Won Yoon et al.
Speech Transformer with Speaker Aware Persistent Memory
Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung et al.
Speech-XLNet: Unsupervised Acoustic Model Pretraining for Self-Attention Networks
Xingchen Song, Guangsen Wang, Yiheng Huang et al.
SpeedySpeech: Efficient Neural Speech Synthesis
Jan Vainer, Ondřej Dušek
SpEx+: A Complete Time Domain Speaker Extraction Network
Meng Ge, Chenglin Xu, Longbiao Wang et al.
Spike-Triggered Non-Autoregressive Transformer for End-to-End Speech Recognition
Zhengkun Tian, Jiangyan Yi, Jianhua Tao et al.
Spoken Content and Voice Factorization for Few-Shot Speaker Adaptation
Tao Wang, Jianhua Tao, Ruibo Fu et al.
Spoken Language ‘Grammatical Error Correction’
Yiting Lu, Mark J.F. Gales, Yu Wang
Spoofing Attack Detection Using the Non-Linear Fusion of Sub-Band Classifiers
Hemlata Tak, Jose Patino, Andreas Nautsch et al.
Spot the Conversation: Speaker Diarisation in the Wild
Joon Son Chung, Jaesung Huh, Arsha Nagrani et al.
Spotting the Traces of Depression in Read Speech: An Approach Based on Computational Paralinguistics and Social Signal Processing
Fuxiang Tao, Anna Esposito, Alessandro Vinciarelli
Squeeze for Sneeze: Compact Neural Networks for Cold and Flu Recognition
Merlin Albes, Zhao Ren, Björn W. Schuller et al.
Stacked 1D Convolutional Networks for End-to-End Small Footprint Voice Trigger Detection
Takuya Higuchi, Mohammad Ghasemzadeh, Kisun You et al.
Staged Knowledge Distillation for End-to-End Dysarthric Speech Recognition and Speech Attribute Transcription
Yuqin Lin, Longbiao Wang, Sheng Li et al.
State Sequence Pooling Training of Acoustic Models for Keyword Spotting
Kuba Łopatka, Tobias Bocklet
Statistical and Neural Network Based Speech Activity Detection in Non-Stationary Acoustic Environments
Jens Heitkaemper, Joerg Schmalenstroeer, Reinhold Haeb-Umbach
Statistical Testing on ASR Performance via Blockwise Bootstrap
Zhe Liu, Fuchun Peng
STC-Innovation Speaker Recognition Systems for Far-Field Speaker Verification Challenge 2020
Aleksei Gusev, Vladimir Volokhov, Alisa Vinogradova et al.
Stochastic Convolutional Recurrent Networks for Language Modeling
Jen-Tzung Chien, Yu-Min Huang
Stochastic Curiosity Exploration for Dialogue Systems
Jen-Tzung Chien, Po-Chien Hsu
Stochastic Talking Face Generation Using Latent Distribution Matching
Ravindra Yadav, Ashish Sardana, Vinay P. Namboodiri et al.
StoRIR: Stochastic Room Impulse Response Generation for Audio Data Augmentation
Piotr Masztalski, Mateusz Matuszewski, Karol Piaskowski et al.