Papers
Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression
Hangting Chen, Jianwei Yu, Yi Luo et al.
Uncertainty Estimation for Connectionist Temporal Classification Based Automatic Speech Recognition
Lars Rumberg, Christopher Gebauer, Hanna Ehlert et al.
Understanding Disrupted Sentences Using Underspecified Abstract Meaning Representation
Angus Addlesee, Marco Damonte
Understanding Spoken Language Development of Children with ASD Using Pre-trained Speech Embeddings
Anfeng Xu, Rajat Hebbar, Rimita Lahiri et al.
UnDiff: Unsupervised Voice Restoration with Unconditional Diffusion Model
Anastasiia Iashchenko, Pavel Andreev, Ivan Shchekotov et al.
Unified Modeling of Multi-Talker Overlapped Speech Recognition and Diarization with a Sidecar Separator
Lingwei Meng, Jiawen Kang, Mingyu Cui et al.
UniFLG: Unified Facial Landmark Generator from Text or Speech
Kentaro Mitsui, Yukiya Hono, Kei Sawada
UniSplice: Universal Cross-Lingual Data Splicing for Low-Resource ASR
Wei Wang, Yanmin Qian
UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data
Heeseung Kim, Sungwon Kim, Jiheum Yeom et al.
Universal Automatic Phonetic Transcription into the International Phonetic Alphabet
Chihiro Taguchi, Yusuke Sakai, Parisa Haghani et al.
UnSE: Unsupervised Speech Enhancement Using Optimal Transport
Wenbin Jiang, Fei Wen, Yifan Zhang et al.
Unsupervised Active Learning: Optimizing Labeling Cost-Effectiveness for Automatic Speech Recognition
Zhisheng Zheng, Ziyang Ma, Yu Wang et al.
Unsupervised Adaptation with Quality-Aware Masking to Improve Target-Speaker Voice Activity Detection for Speaker Diarization
Shutong Niu, Jun Du, Maokui He et al.
Unsupervised Auditory and Semantic Entrainment Models with Deep Neural Networks
Jay Kejriwal, Štefan Beňuš, Lina M. Rojas-Barahona
Unsupervised Code-switched Text Generation from Parallel Text
Jie Chi, Brian Lu, Jason Eisner et al.
Unsupervised Dialogue Topic Segmentation in Hyperdimensional Space
Seongmin Park, Jinkyu Seo, Jihwa Lee
Unsupervised Learning of Discrete Latent Representations with Data-Adaptive Dimensionality from Continuous Speech Streams
Shun Takahashi, Sakriani Sakti
Unsupervised Out-of-Distribution Dialect Detection with Mahalanobis Distance
Sourya Dipta Das, Yash Vadi, Abhishek Unnam et al.
Unsupervised speech enhancement with deep dynamical generative speech and noise models
Xiaoyu Lin, Simon Leglaive, Laurent Girin et al.
Unsupervised Transfer Components Learning for Cross-Domain Speech Emotion Recognition
Shenjie Jiang, Peng Song, Shaokai Li et al.
Use of Speech Impairment Severity for Dysarthric Speech Recognition
Mengzhe Geng, Zengrui Jin, Tianzi Wang et al.
Using Commercial ASR Solutions to Assess Reading Skills in Children: A Case Report
Timothy Piton, Enno Hermann, Angela Pasqualotto et al.
Using Semi-supervised Learning for Monaural Time-domain Speech Separation with a Self-supervised Learning-based SI-SNR Estimator
Shaoxiang Dang, Tetsuya Matsumoto, Yoshinori Takeuchi et al.
Using speech synthesis to explain automatic speaker recognition: a new application of synthetic speech
Georgina Brown, Christin Kirchhübel, Ramiz Cuthbert