Papers
8,761 papers found
Unsupervised Improved MVDR Beamforming for Sound Enhancement
Jacob Kealey, John R. Hershey, François Grondin
Unsupervised Online Continual Learning for Automatic Speech Recognition
Steven Vander Eeckt, Hugo Van hamme
Unveiling Biases while Embracing Sustainability: Assessing the Dual Challenges of Automatic Speech Recognition Systems
Ajinkya Kulkarni, Atharva Kulkarni, Miguel Couceiro et al.
Urdu Alternative Questions: A Hat Pattern
Benazir Mumtaz, Miriam Butt
URGENT Challenge: Universality, Robustness, and Generalizability For Speech Enhancement
Wangyou Zhang, Robin Scheibler, Kohei Saijo et al.
USD-AC: Unsupervised Speech Disentanglement for Accent Conversion
Jen-Hung Huang, Wei-Tsung Lee, Chung-Hsien Wu
Using articulated speech EEG signals for imagined speech decoding
Chris Bras, Tanvina Patel, Odette Scharenborg
Using Large Language Model for End-to-End Chinese ASR and NER
Yuang Li, Jiawei Yu, Min Zhang et al.
Using wav2vec 2.0 for phonetic classification tasks: methodological aspects
Lila Kim, Cédric Gendrot
USM RNN-T model weights binarization
Oleg Rybakov, Dmitriy Serdyuk, Chengjian Zheng
Utilization of Text Data for Response Timing Detection in Attentive Listening
Yu Watanabe, Koichiro Ito, Shigeki Matsubara
UY/CH-CHILD -- A Public Chinese L2 Speech Database of Uyghur Children
Mewlude Nijat, Chen Chen, Dong Wang et al.
Variability of speech timing features across repeated recordings: a comparison of open-source extraction techniques
Judith Dineley, Ewan Carr, Lauren L. White et al.
Variable Segment Length and Domain-Adapted Feature Optimization for Speaker Diarization
Chenyuan Zhang, Linkai Luo, Hong Peng et al.
VECL-TTS: Voice identity and Emotional style controllable Cross-Lingual Text-to-Speech
Ashishkumar Gudmalwar, Nirmesh Shah, Sai Akarsh et al.
Vec-Tok-VC+: Residual-enhanced Robust Zero-shot Voice Conversion with Progressive Constraints in a Dual-mode Training Strategy
Linhan Ma, Xinfa Zhu, Yuanjun Lv et al.
Vision Transformer Segmentation for Visual Bird Sound Denoising
Sahil Kumar, Jialu Li, Youshan Zhang
Visualization for improving foreign language pronunciation
Charlotte Yoder, Karrie Karahalios, Mark Hasegawa-Johnson et al.
Visual scene display application for augmentative and alternative communication
Karthik Venkat Sridaran, Raja Praveen, Reuben T Varghese et al.
VN-SLU: A Vietnamese Spoken Language Understanding Dataset
Tuyen Tran, Khanh Le, Ngoc Dang Nguyen et al.
Voiced and voiceless laterals in Angami
Viyazonuo Terhiija, Priyankoo Sarmah
VoiceDefense: Protecting Automatic Speaker Verification Models Against Black-box Adversarial Attacks
Yip Keng Kan, Ke Xu, Hao Li et al.
Voice Disorder Analysis: a Transformer-based Approach
Alkis Koudounas, Gabriele Ciravegna, Marco Fantini et al.