Papers
Stochastic Curiosity Exploration for Dialogue Systems
Jen-Tzung Chien, Po-Chien Hsu
Stochastic Talking Face Generation Using Latent Distribution Matching
Ravindra Yadav, Ashish Sardana, Vinay P. Namboodiri et al.
StoRIR: Stochastic Room Impulse Response Generation for Audio Data Augmentation
Piotr Masztalski, Mateusz Matuszewski, Karol Piaskowski et al.
Strategies for End-to-End Text-Independent Speaker Verification
Weiwei Lin, Man-Wai Mak, Jen-Tzung Chien
StrawNet: Self-Training WaveNet for TTS in Low-Data Regimes
Manish Sharma, Tom Kenter, Rob Clark
Streaming Chunk-Aware Multihead Attention for Online End-to-End Speech Recognition
Shiliang Zhang, Zhifu Gao, Haoneng Luo et al.
Streaming Keyword Spotting on Mobile Devices
Oleg Rybakov, Natasha Kononenko, Niranjan Subrahmanya et al.
Streaming On-Device End-to-End ASR System for Privacy-Sensitive Voice-Typing
Abhinav Garg, Gowtham P. Vadisetti, Dhananjaya Gowda et al.
Streaming Transformer-Based Acoustic Models Using Self-Attention with Augmented Memory
Chunyang Wu, Yongqiang Wang, Yangyang Shi et al.
Style Attuned Pre-Training and Parameter Efficient Fine-Tuning for Spoken Language Understanding
Jin Cao, Jun Wang, Wael Hamza et al.
Style Variation as a Vantage Point for Code-Switching
Khyathi Raghavi Chandu, Alan W. Black
Subband Kalman Filtering with DNN Estimated Parameters for Speech Enhancement
Hongjiang Yu, Wei-Ping Zhu, Benoit Champagne
Sub-Band Knowledge Distillation Framework for Speech Enhancement
Xiang Hao, Shixue Wen, Xiangdong Su et al.
Subjective Quality Evaluation of Speech Signals Transmitted via BPL-PLC Wired System
Przemyslaw Falkowski-Gilski, Grzegorz Debita, Marcin Habrych et al.
Subword Regularization: An Analysis of Scalability and Generalization for End-to-End Automatic Speech Recognition
Egor Lakomkin, Jahn Heymann, Ilya Sklyar et al.
Sum-Product Networks for Robust Automatic Speaker Identification
Aaron Nicolson, Kuldip K. Paliwal
Supervised Domain Adaptation for Text-Independent Speaker Verification Using Limited Data
Seyyed Saeed Sarfjoo, Srikanth Madikeri, Petr Motlicek et al.
Surfboard: Audio Feature Extraction for Modern Machine Learning
Raphael Lenain, Jack Weston, Abhishek Shivkumar et al.
Surgical Mask Detection with Convolutional Neural Networks and Data Augmentations on Spectrograms
Steffen Illium, Robert Müller, Andreas Sedlmeier et al.
Surgical Mask Detection with Deep Recurrent Phonetic Models
Philipp Klumpp, Tomás Arias-Vergara, Juan Camilo Vásquez-Correa et al.
Tackling the ADReSS Challenge: A Multimodal Approach to the Automated Recognition of Alzheimer’s Dementia
Matej Martinc, Senja Pollak
Targeted Content Feedback in Spoken Language Learning and Assessment
Xinhao Wang, Klaus Zechner, Christopher Hamill
Target-Speaker Voice Activity Detection: A Novel Approach for Multi-Speaker Diarization in a Dinner Party Scenario
Ivan Medennikov, Maxim Korenevsky, Tatiana Prisyach et al.
Task-Oriented Dialog Generation with Enhanced Entity Representation
Zhenhao He, Jiachun Wang, Jian Chen