Papers
Self-Training for End-to-End Speech Translation
Juan Pino, Qiantong Xu, Xutai Ma et al.
Semantic Complexity in End-to-End Spoken Language Understanding
Joseph P. McKenna, Samridhi Choudhary, Michael Saxon et al.
Semantic Mask for Transformer Based End-to-End Speech Recognition
Chengyi Wang, Yu Wu, Yujiao Du et al.
Semi-Supervised ASR by End-to-End Self-Training
Yang Chen, Weiran Wang, Chao Wang
Semi-Supervised End-to-End ASR via Teacher-Student Learning with Conditional Posterior Distribution
Zi-qiang Zhang, Yan Song, Jian-shu Zhang et al.
Semi-Supervised Learning for Character Expression of Spoken Dialogue Systems
Kenta Yamamoto, Koji Inoue, Tatsuya Kawahara
Semi-Supervised Learning for Multi-Speaker Text-to-Speech Synthesis Using Discrete Speech Representation
Tao Tu, Yuan-Jui Chen, Alexander H. Liu et al.
Semi-Supervised Learning with Data Augmentation for End-to-End ASR
Felix Weninger, Franco Mana, Roberto Gemello et al.
Sentence Level Estimation of Psycholinguistic Norms Using Joint Multidimensional Annotations
Anil Ramakrishna, Shrikanth Narayanan
Separating Varying Numbers of Sources with Auxiliary Autoencoding Loss
Yi Luo, Nima Mesgarani
Sequence-Level Self-Learning with Multiple Hypotheses
Kenichi Kumatani, Dimitrios Dimitriadis, Yashesh Gaur et al.
Sequence-to-Sequence Articulatory Inversion Through Time Convolution of Sub-Band Frequency Signals
Abdolreza Sabzi Shahrebabaki, Sabato Marco Siniscalchi, Giampiero Salvi et al.
Serialized Output Training for End-to-End Overlapped Speech Recognition
Naoyuki Kanda, Yashesh Gaur, Xiaofei Wang et al.
SERIL: Noise Adaptive Speech Enhancement Using Regularization-Based Incremental Learning
Chi-Chang Lee, Yu-Chen Lin, Hsuan-Tien Lin et al.
Shadowability Annotation with Fine Granularity on L2 Utterances and its Improvement with Native Listeners’ Script-Shadowing
Zhenchao Lin, Ryo Takashima, Daisuke Saito et al.
Should we Hard-Code the Recurrence Concept or Learn it Instead ? Exploring the Transformer Architecture for Audio-Visual Speech Recognition
George Sterpu, Christian Saam, Naomi Harte
Shouted Speech Compensation for Speaker Verification Robust to Vocal Effort Conditions
Santi Prieto, Alfonso Ortega, Iván López-Espejo et al.
Siamese Convolutional Neural Network Using Gaussian Probability Feature for Spoofing Speech Detection
Zhenchun Lei, Yingen Yang, Changhong Liu et al.
Siamese X-Vector Reconstruction for Domain Adapted Speaker Recognition
Shai Rozenberg, Hagai Aronowitz, Ron Hoory
Simultaneous Conversion of Speaker Identity and Emotion Based on Multiple-Domain Adaptive RBM
Takuya Kishida, Shin Tsukamoto, Toru Nakashika
Singing Synthesis: With a Little Help from my Attention
Orazio Angelini, Alexis Moinet, Kayoko Yanagisawa et al.
Singing Voice Extraction with Attention-Based Spectrograms Fusion
Hao Shi, Longbiao Wang, Sheng Li et al.