Papers
Transfer Learning and Data Augmentation Techniques to the COVID-19 Identification Tasks in ComParE 2021
Edresson Casanova, Arnaldo Candido Jr., Ricardo Corso Fernandes Jr. et al.
Transfer Learning-Based Cough Representations for Automatic Detection of COVID-19
Rubén Solera-Ureña, Catarina Botelho, Francisco Teixeira et al.
Transfer Learning for Speech Intelligibility Improvement in Noisy Environments
Ritujoy Biswas, Karan Nathwani, Vinayak Abrol
Transformer-Based Acoustic Modeling for Streaming Speech Synthesis
Chunyang Wu, Zhiping Xiu, Yangyang Shi et al.
Transformer-Based ASR Incorporating Time-Reduction Layer and Fine-Tuning with Self-Knowledge Distillation
Md. Akmal Haidar, Chao Xing, Mehdi Rezagholizadeh
Transformer Based End-to-End Mispronunciation Detection and Diagnosis
Minglin Wu, Kun Li, Wai-Kim Leung et al.
Triple M: A Practical Text-to-Speech Synthesis System with Multi-Guidance Attention and Multi-Band Multi-Time LPCNet
Shilun Lin, Fenglong Xie, Li Meng et al.
Tusom2021: A Phonetically Transcribed Speech Dataset from an Endangered Language for Universal Phone Recognition Experiments
David R. Mortensen, Jordan Picone, Xinjian Li et al.
TVQVC: Transformer Based Vector Quantized Variational Autoencoder with CTC Loss for Voice Conversion
Ziyi Chen, Pengyuan Zhang
Two-Pathway Style Embedding for Arbitrary Voice Conversion
Xuexin Xu, Liang Shi, Jinhui Chen et al.
Ultra Fast Speech Separation Model with Teacher Student Learning
Sanyuan Chen, Yu Wu, Zhuo Chen et al.
Uncertainty-Aware COVID-19 Detection from Imbalanced Sound Data
Tong Xia, Jing Han, Lorena Qendro et al.
Uncovering the Acoustic Cues of COVID-19 Infection
Sriram Ganapathy
Understanding Medical Conversations: Rich Transcription, Confidence Scores & Information Extraction
Hagen Soltau, Mingqiu Wang, Izhak Shafran et al.
Unified Autoregressive Modeling for Joint End-to-End Multi-Talker Overlapped Speech Recognition and Speaker Attribute Estimation
Ryo Masumura, Daiki Okamura, Naoki Makishima et al.
Unified Source-Filter GAN: Unified Source-Filter Network Based On Factorization of Quasi-Periodic Parallel WaveGAN
Reo Yoneyama, Yi-Chiao Wu, Tomoki Toda
UnitNet-Based Hybrid Speech Synthesis
Xiao Zhou, Zhen-Hua Ling, Li-Rong Dai
Universal Speaker Extraction in the Presence and Absence of Target Speakers for Speech of One and Two Talkers
Marvin Borsdorf, Chenglin Xu, Haizhou Li et al.
UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation
Won Jang, Dan Lim, Jaesam Yoon et al.
Unsupervised Acoustic Unit Discovery by Leveraging a Language-Independent Subword Discriminative Feature Representation
Siyuan Feng, Piotr Żelasko, Laureano Moro-Velázquez et al.
Unsupervised Bayesian Adaptation of PLDA for Speaker Verification
Bengt J. Borgström
Unsupervised Cross-Lingual Representation Learning for Speech Recognition
Alexis Conneau, Alexei Baevski, Ronan Collobert et al.
Unsupervised Domain Adaptation for Dysarthric Speech Detection via Domain Adversarial Training and Mutual Information Minimization
Disong Wang, Liqun Deng, Yu Ting Yeung et al.
Unsupervised Learning of Disentangled Speech Content and Style Representation
Andros Tjandra, Ruoming Pang, Yu Zhang et al.
Unsupervised Multi-Target Domain Adaptation for Acoustic Scene Classification
Dongchao Yang, Helin Wang, Yuexian Zou