Papers
Training Speaker Enrollment Models by Network Optimization
Victoria Mingote, Antonio Miguel, Alfonso Ortega et al.
Transfer Learning Approaches for Streaming End-to-End Speech Recognition System
Vikas Joshi, Rui Zhao, Rupesh R. Mehta et al.
Transfer Learning for Improving Singing-Voice Detection in Polyphonic Instrumental Music
Yuanbo Hou, Frank K. Soong, Jian Luan et al.
Transfer Learning of Articulatory Information Through Phone Information
Abdolreza Sabzi Shahrebabaki, Negar Olfati, Sabato Marco Siniscalchi et al.
Transfer Learning of the Expressivity Using FLOW Metric Learning in Multispeaker Text-to-Speech Synthesis
Ajinkya Kulkarni, Vincent Colotte, Denis Jouvet
Transferring Source Style in Non-Parallel Voice Conversion
Songxiang Liu, Yuewen Cao, Shiyin Kang et al.
Transformer-Based Long-Context End-to-End Speech Recognition
Takaaki Hori, Niko Moritz, Chiori Hori et al.
Transformer VQ-VAE for Unsupervised Unit Discovery and Speech Synthesis: ZeroSpeech 2020 Challenge
Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
Transformer with Bidirectional Decoder for Speech Recognition
Xi Chen, Songyang Zhang, Dandan Song et al.
Transliteration Based Data Augmentation for Training Multilingual ASR Acoustic Models in Low Resource Settings
Samuel Thomas, Kartik Audhkhasi, Brian Kingsbury
TTS Skins: Speaker Conversion via ASR
Adam Polyak, Lior Wolf, Yaniv Taigman
Ultrasound-Based Articulatory-to-Acoustic Mapping with WaveGlow Speech Synthesis
Tamás Gábor Csapó, Csaba Zainkó, László Tóth et al.
Uncertainty-Aware Machine Support for Paper Reviewing on the Interspeech 2019 Submission Corpus
Lukas Stappen, Georgios Rizos, Madina Hasan et al.
UncommonVoice: A Crowdsourced Dataset of Dysphonic Speech
Meredith Moore, Piyush Papreja, Michael Saxon et al.
Unconditional Audio Generation with Generative Adversarial Networks and Cycle Regularization
Jen-Yu Liu, Yu-Hua Chen, Yin-Cheng Yeh et al.
Understanding Racial Disparities in Automatic Speech Recognition: The Case of Habitual “be”
Joshua L. Martin, Kevin Tang
Understanding Self-Attention of Self-Supervised Audio Transformers
Shu-wen Yang, Andy T. Liu, Hung-yi Lee
Understanding the Effect of Voice Quality and Accent on Talker Similarity
Anurag Das, Guanlong Zhao, John Levis et al.
U-Net Based Direct-Path Dominance Test for Robust Direction-of-Arrival Estimation
Hao Wang, Kai Chen, Jing Lu
Universal Adversarial Attacks on Spoken Language Assessment Systems
Vyas Raina, Mark J.F. Gales, Kate M. Knill
Universal Speech Transformer
Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung et al.
Unsupervised Acoustic Unit Representation Learning for Voice Conversion Using WaveNet Auto-Encoders
Mingjie Chen, Thomas Hain
Unsupervised Audio Source Separation Using Generative Priors
Vivek Narayanaswamy, Jayaraman J. Thiagarajan, Rushil Anirudh et al.