Papers
SdSVC Challenge 2021: Tips and Tricks to Boost the Short-Duration Speaker Verification System Performance
Aleksei Gusev, Alisa Vinogradova, Sergey Novoselov et al.
SE-Conformer: Time-Domain Speech Enhancement Using Conformer
Eesung Kim, Hyeji Seo
“See what I mean, huh?” Evaluating Visual Inspection of F0Tracking in Nasal Grunts
Aurélie Chlébowski, Nicolas Ballier
Segmental Contrastive Predictive Coding for Unsupervised Word Segmentation
Saurabhchand Bhati, Jesús Villalba, Piotr Żelasko et al.
Segment and Tone Production in Continuous Speech of Hearing and Hearing-Impaired Children
Shu-Chuan Tseng, Yi-Fen Liu
Self-Adaptive Distillation for Multilingual Speech Recognition: Leveraging Student Independence
Isabel Leal, Neeraj Gaur, Parisa Haghani et al.
Self-Attention Channel Combinator Frontend for End-to-End Multichannel Far-Field Speech Recognition
Rong Gong, Carl Quillen, Dushyant Sharma et al.
Self-Paced Ensemble Learning for Speech and Audio Classification
Nicolae-Cătălin Ristea, Radu Tudor Ionescu
Self-Supervised Dialogue Learning for Spoken Conversational Question Answering
Nuo Chen, Chenyu You, Yuexian Zou
Self-Supervised End-to-End ASR for Low Resource L2 Swedish
Ragheb Al-Ghezi, Yaroslav Getman, Aku Rouhe et al.
Self-Supervised Learning Based Phone-Fortified Speech Enhancement
Yuanhang Qiu, Ruili Wang, Satwinder Singh et al.
Self-Supervised Phonotactic Representations for Language Identification
G. Ramesh, C. Shiva Kumar, K. Sri Rama Murty
Semantic Data Augmentation for End-to-End Mandarin Speech Recognition
Jianwei Sun, Zhiyuan Tang, Hengxin Yin et al.
Semantic Distance: A New Metric for ASR Performance Analysis Towards Spoken Language Understanding
Suyoun Kim, Abhinav Arora, Duc Le et al.
Semantic Sentence Similarity: Size does not Always Matter
Danny Merkx, Stefan L. Frank, Mirjam Ernestus
Semantic Transportation Prototypical Network for Few-Shot Intent Detection
Weiyuan Xu, Peilin Zhou, Chenyu You et al.
Semi-Supervised Training with Pseudo-Labeling for End-To-End Neural Diarization
Yuki Takashima, Yusuke Fujita, Shota Horiguchi et al.
Semi-Supervision in ASR: Sequential MixMatch and Factorized TTS-Based Augmentation
Zhehuai Chen, Andrew Rosenberg, Yu Zhang et al.
Separation of Emotional and Reconstruction Embeddings on Ladder Network to Improve Speech Emotion Recognition Robustness in Noisy Conditions
Seong-Gyun Leem, Daniel Fulford, Jukka-Pekka Onnela et al.
Sequence-Level Confidence Classifier for ASR Utterance Accuracy and Application to Acoustic Models
Amber Afshan, Kshitiz Kumar, Jian Wu
Sequence-to-Sequence Learning for Deep Gaussian Process Based Speech Synthesis Using Self-Attention GP Layer
Taiki Nakamura, Tomoki Koriyama, Hiroshi Saruwatari
Sequential End-to-End Intent and Slot Label Classification and Localization
Yiran Cao, Nihal Potdar, Anderson R. Avila
Serialized Multi-Layer Multi-Head Attention for Neural Speaker Embedding
Hongning Zhu, Kong Aik Lee, Haizhou Li
Shallow Convolution-Augmented Transformer with Differentiable Neural Computer for Low-Complexity Classification of Variable-Length Acoustic Scene
Soonshin Seo, Donghyun Lee, Ji-Hwan Kim