Papers
Unsupervised Cross-Domain Singing Voice Conversion
Adam Polyak, Lior Wolf, Yossi Adi et al.
Unsupervised Discovery of Recurring Speech Patterns Using Probabilistic Adaptive Metrics
Okko Räsänen, María Andrea Cruz Blandón
Unsupervised Domain Adaptation for Dialogue Sequence Labeling Based on Hierarchical Adversarial Training
Shota Orihashi, Mana Ihori, Tomohiro Tanaka et al.
Unsupervised Domain Adaptation Under Label Space Mismatch for Speech Classification
Akhil Mathur, Nadia Berthouze, Nicholas D. Lane
Unsupervised Learning for Sequence-to-Sequence Text-to-Speech for Low-Resource Languages
Haitong Zhang, Yue Lin
Unsupervised Methods for Evaluating Speech Representations
Michael Gump, Wei-Ning Hsu, James Glass
Unsupervised Regularization-Based Adaptive Training for Speech Recognition
Fenglin Ding, Wu Guo, Bin Gu et al.
Unsupervised Robust Speech Enhancement Based on Alpha-Stable Fast Multichannel Nonnegative Matrix Factorization
Mathieu Fontaine, Kouhei Sekiguchi, Aditya Arie Nugraha et al.
Unsupervised Subword Modeling Using Autoregressive Pretraining and Cross-Lingual Phone-Aware Modeling
Siyuan Feng, Odette Scharenborg
Unsupervised Training of Siamese Networks for Speaker Verification
Umair Khan, Javier Hernando
Unsupervised vs. Transfer Learning for Multimodal One-Shot Matching of Speech and Images
Leanne Nortje, Herman Kamper
UNSW System Description for the Shared Task on Automatic Speech Recognition for Non-Native Children’s Speech
Mostafa Shahin, Renée Lu, Julien Epps et al.
Using Cyclic Noise as the Source Signal for Neural Source-Filter-Based Speech Waveform Model
Xin Wang, Junichi Yamagishi
Using Silence MR Image to Synthesise Dynamic MRI Vocal Tract Data of CV
Ioannis K. Douros, Ajinkya Kulkarni, Chrysanthi Dourou et al.
Using Speaker-Aligned Graph Memory Block in Multimodally Attentive Emotion Recognition Network
Jeng-Lin Li, Chi-Chun Lee
Using Speech Enhancement Preprocessing for Speech Emotion Recognition in Realistic Noisy Conditions
Hengshun Zhou, Jun Du, Yan-Hui Tu et al.
Using State of the Art Speaker Recognition and Natural Language Processing Technologies to Detect Alzheimer’s Disease and Assess its Severity
Raghavendra Pappagari, Jaejin Cho, Laureano Moro-Velázquez et al.
Utterance Confidence Measure for End-to-End Speech Recognition with Applications to Distributed Speech Recognition Scenarios
Ankur Kumar, Sachin Singh, Dhananjaya Gowda et al.
Utterance Invariant Training for Hybrid Two-Pass End-to-End Speech Recognition
Dhananjaya Gowda, Ankur Kumar, Kwangyoun Kim et al.
Utterance-Wise Meeting Transcription System Using Asynchronous Distributed Microphones
Shota Horiguchi, Yusuke Fujita, Kenji Nagamatsu
Variable Frame Rate-Based Data Augmentation to Handle Speaking-Style Variability for Automatic Speaker Verification
Amber Afshan, Jinxi Guo, Soo Jin Park et al.
VCTUBE : A Library for Automatic Speech Data Annotation
Seong Choi, Seunghoon Jeong, Jeewoo Yoon et al.
Vector-Based Attentive Pooling for Text-Independent Speaker Verification
Yanfeng Wu, Chenkai Guo, Hongcan Gao et al.