Papers
UniKW-AT: Unified Keyword Spotting and Audio Tagging
Heinrich Dinkel, Yongqing Wang, Zhiyong Yan et al.
Unsupervised Acoustic-to-Articulatory Inversion with Variable Vocal Tract Anatomy
Yifan Sun, Qinlong Huang, Xihong Wu
Unsupervised Data Selection via Discrete Speech Representation for ASR
Zhiyun Lu, Yongqiang Wang, Yu Zhang et al.
Unsupervised domain adaptation for speech recognition with unsupervised error correction
Long Mai, Julie Carson-Berndsen
Unsupervised Inference of Physiologically Meaningful Articulatory Trajectories with VocalTractLab
Yifan Sun, Qinlong Huang, Xihong Wu
Unsupervised Instance Discriminative Learning for Depression Detection from Speech Signals
Jinhan Wang, Vijay Ravi, Jonathan Flint et al.
Unsupervised Speaker Diarization that is Agnostic to Language, Overlap-Aware, and Tuning Free
Md Iftekhar Tanveer, Diego Casabuena, Jussi Karlgren et al.
Unsupervised Symbolic Music Segmentation using Ensemble Temporal Prediction Errors
Shahaf Bassan, Yossi Adi, Jeffrey Rosenschein
Unsupervised Text-to-Speech Synthesis by Unsupervised Automatic Speech Recognition
Junrui Ni, Liming Wang, Heting Gao et al.
Unsupervised Training of Sequential Neural Beamformer Using Coarsely-separated and Non-separated Signals
Kohei Saijo, Tetsuji Ogawa
Unsupervised Uncertainty Measures of Automatic Speech Recognition for Non-intrusive Speech Intelligibility Prediction
Zehai Tu, Ning Ma, Jon Barker
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering
Eklavya Sarkar, RaviShankar Prasad, Mathew Magimai Doss
Unsupervised Word Segmentation using K Nearest Neighbors
Tzeviya Fuchs, Yedid Hoshen, Yossi Keshet
Updating Only Encoders Prevents Catastrophic Forgetting of End-to-End ASR Models
Yuki Takashima, Shota Horiguchi, Shinji Watanabe et al.
Use of Nods Less Synchronized with Turn-Taking and Prosody During Conversations in Adults with Autism
Keiko Ochi, Nobutaka Ono, Keiho Owada et al.
Use of prosodic and lexical cues for disambiguating wh-words in Korean
Jieun Song, Hae-Sung Jeon, Jieun Kiaer
User-Level Differential Privacy against Attribute Inference Attack of Speech Emotion Recognition on Federated Learning
Tiantian Feng, Raghuveer Peri, Shrikanth Narayanan
UserLibri: A Dataset for ASR Personalization Using Only Text
Theresa Breiner, Swaroop Ramaswamy, Ehsan Variani et al.
Using cross-model learnings for the Gram Vaani ASR Challenge 2022
Tanvina Patel, Odette Scharenborg
Using Fluency Representation Learned from Sequential Raw Features for Improving Non-native Fluency Scoring
Kaiqi Fu, Shaojun Gao, Xiaohai Tian et al.
Using Rater and System Metadata to Explain Variance in the VoiceMOS Challenge 2022 Dataset
Michael Chinen, Jan Skoglund, Chandan K. A. Reddy et al.
UTMOS: UTokyo-SaruLab System for VoiceMOS Challenge 2022
Takaaki Saeki, Detai Xin, Wataru Nakata et al.
Utterance-by-utterance overlap-aware neural diarization with Graph-PIT
Keisuke Kinoshita, Thilo von Neumann, Marc Delcroix et al.
Vaccinating SER to Neutralize Adversarial Attacks with Self-Supervised Augmentation Strategy
Bo-Hao Su, Chi-Chun Lee