Papers
Robust Beam Search for Encoder-Decoder Attention Based Speech Recognition Without Length Bias
Wei Zhou, Ralf Schlüter, Hermann Ney
Robust Pitch Regression with Voiced/Unvoiced Classification in Nonstationary Noise Environments
Dung N. Tran, Uros Batricevic, Kazuhito Koishida
Robust Raw Waveform Speech Recognition Using Relevance Weighted Representations
Purvi Agrawal, Sriram Ganapathy
Robust Text-Dependent Speaker Verification via Character-Level Information Preservation for the SdSV Challenge 2020
Sung Hwan Mun, Woo Hyun Kang, Min Hyun Han et al.
S2IGAN: Speech-to-Image Generation via Adversarial Learning
Xinsheng Wang, Tingting Qiao, Jihua Zhu et al.
SAN-M: Memory Equipped Self-Attention for End-to-End Speech Recognition
Zhifu Gao, Shiliang Zhang, Ming Lei et al.
SCADA: Stochastic, Consistent and Adversarial Data Augmentation to Improve ASR
Gary Wang, Andrew Rosenberg, Zhehuai Chen et al.
Scaling Processes of Clause Chains in Pitjantjatjara
Rebecca Defina, Catalina Torres, Hywel Stoakes
Scaling Up Online Speech Recognition Using ConvNets
Vineel Pratap, Qiantong Xu, Jacob Kahn et al.
SdSV Challenge 2020: Large-Scale Evaluation of Short-Duration Speaker Verification
Hossein Zeinali, Kong Aik Lee, Jahangir Alam et al.
SEANet: A Multi-Modal Speech Enhancement Network
Marco Tagliasacchi, Yunpeng Li, Karolis Misiunas et al.
Secondary Phonetic Cues in the Production of the Nasal Short-a System in California English
Georgia Zellou, Rebecca Scarborough, Renee Kemp
Seeing Voices and Hearing Voices: Learning Discriminative Embeddings Using Cross-Modal Self-Supervision
Soo-Whan Chung, Hong-Goo Kang, Joon Son Chung
Segment Aggregation for Short Utterances Speaker Verification Using Raw Waveforms
Seung-bin Kim, Jee-weon Jung, Hye-jin Shim et al.
Segment-Level Effects of Gender, Nationality and Emotion Information on Text-Independent Speaker Verification
Kai Li, Masato Akagi, Yibo Wu et al.
Self-and-Mixed Attention Decoder with Deep Acoustic Structure for Transformer-Based LVCSR
Xinyuan Zhou, Grandee Lee, Emre Yılmaz et al.
Self-Attention Encoding and Pooling for Speaker Recognition
Pooyan Safari, Miquel India, Javier Hernando
Self-Attentive Similarity Measurement Strategies in Speaker Diarization
Qingjian Lin, Yu Hou, Ming Li
Self-Distillation for Improving CTC-Transformer-Based ASR Systems
Takafumi Moriya, Tsubasa Ochiai, Shigeki Karita et al.
Self-Expressing Autoencoders for Unsupervised Spoken Term Discovery
Saurabhchand Bhati, Jesús Villalba, Piotr Żelasko et al.
Self-Supervised Adversarial Multi-Task Learning for Vocoder-Based Monaural Speech Enhancement
Zhihao Du, Ming Lei, Jiqing Han et al.
Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation
Felix Kreuk, Joseph Keshet, Yossi Adi
Self-Supervised Pre-Training with Acoustic Configurations for Replay Spoofing Detection
Hye-jin Shim, Hee-Soo Heo, Jee-weon Jung et al.
Self-Supervised Representations Improve End-to-End Speech Translation
Anne Wu, Changhan Wang, Juan Pino et al.
Self-Supervised Spoofing Audio Detection Scheme
Ziyue Jiang, Hongcheng Zhu, Li Peng et al.