Papers
Unsupervised Neural-Based Graph Clustering for Variable-Length Speech Representation Discovery of Zero-Resource Languages
Shun Takahashi, Sakriani Sakti, Satoshi Nakamura
Unsupervised Representation Learning for Speech Activity Detection in the Fearless Steps Challenge 2021
Pablo Gimeno, Alfonso Ortega, Antonio Miguel et al.
Unsupervised Training of a DNN-Based Formant Tracker
Jason Lilley, H. Timothy Bunnell
User-Initiated Repetition-Based Recovery in Multi-Utterance Dialogue Systems
Hoang Long Nguyen, Vincent Renkens, Joris Pelemans et al.
Using Games to Augment Corpora for Language Recognition and Confusability
Christopher Cieri, James Fiumara, Jonathan Wright
Using Large Self-Supervised Models for Low-Resource Speech Recognition
Krishna D. N, Pinyi Wang, Bruno Bozza
Using the Outputs of Different Automatic Speech Recognition Paradigms for Acoustic- and BERT-Based Alzheimer’s Dementia Detection Through Spontaneous Speech
Yilin Pan, Bahman Mirheidari, Jennifer M. Harris et al.
Using Transposed Convolution for Articulatory-to-Acoustic Conversion from Real-Time MRI Data
Ryo Tanji, Hidefumi Ohmura, Kouichi Katsurada
Using X-Vectors for Speech Activity Detection in Broadcast Streams
Lukas Mateju, Frantisek Kynych, Petr Cerva et al.
Utilizing Self-Supervised Representations for MOS Prediction
Wei-Cheng Tseng, Chien-yu Huang, Wei-Tsung Kao et al.
VAD-Free Streaming Hybrid CTC/Attention ASR for Unsegmented Recording
Hirofumi Inaguma, Tatsuya Kawahara
VAENAR-TTS: Variational Auto-Encoder Based Non-AutoRegressive Text-to-Speech Synthesis
Hui Lu, Zhiyong Wu, Xixin Wu et al.
Variable Frame Rate Acoustic Models Using Minimum Error Reinforcement Learning
Dongcheng Jiang, Chao Zhang, Philip C. Woodland
Variational Auto-Encoder Based Variability Encoding for Dysarthric Speech Recognition
Xurong Xie, Rukiye Ruzi, Xunying Liu et al.
Variational Information Bottleneck Based Regularization for Speaker Recognition
Dan Wang, Yuanjie Dong, Yaxing Li et al.
Variational Information Bottleneck for Effective Low-Resource Audio Classification
Shijing Si, Jianzong Wang, Huiming Sun et al.
Variation in Perceptual Sensitivity and Compensation for Coarticulation Across Adult and Child Naturally-Produced and TTS Voices
Aleese Block, Michelle Cohn, Georgia Zellou
ViSTAFAE: A Visual Speech-Training Aid with Feedback of Articulatory Efforts
Pramod H. Kachare, Prem C. Pandey, Vishal Mane et al.
Visualizing Classifier Adjacency Relations: A Case Study in Speaker Verification and Voice Anti-Spoofing
Tomi Kinnunen, Andreas Nautsch, Md. Sahidullah et al.
Visual Speech for Obstructive Sleep Apnea Detection
Catarina Botelho, Alberto Abad, Tanja Schultz et al.
Visual Transformers for Primates Classification and Covid Detection
Steffen Illium, Robert Müller, Andreas Sedlmeier et al.
Vocal Harmony Separation Using Time-Domain Neural Networks
Saurjya Sarkar, Emmanouil Benetos, Mark Sandler
VocalTurk: Exploring Feasibility of Crowdsourced Speaker Identification
Susumu Saito, Yuta Ide, Teppei Nakano et al.