Papers
8,761 papers found
Towards a Quantitative Analysis of Coarticulation with a Phoneme-to-Articulatory Model
Chaofei Fan, Jaimie M. Henderson, Chris Manning et al.
Towards Audio Codec-based Speech Separation
Jia Qi Yip, Shengkui Zhao, Dianwen Ng et al.
Towards Classifying Mother Tongue from Infant Cries - Findings Substantiating Prenatal Learning Theory
Tim Polzehl, Tim Herzig, Friedrich Wicke et al.
Towards Effective and Efficient Non-autoregressive Decoding Using Block-based Attention Mask
Tianzi Wang, Xurong Xie, Zhaoqing Li et al.
Towards EMG-to-Speech with Necklace Form Factor
Peter Wu, Ryan Kaveh, Raghav Nautiyal et al.
Towards End-to-End Unified Recognition for Mandarin and Cantonese
Meiling Chen, Pengjie Liu, Heng Yang et al.
Towards Explainable Monaural Speaker Separation with Auditory-based Training
Hassan Taherian, Vahid Ahmadi Kalkhorani, Ashutosh Pandey et al.
Towards Expressive Zero-Shot Speech Synthesis with Hierarchical Prosody Modeling
Yuepeng Jiang, Tao Li, Fengyu Yang et al.
Towards generalisable and calibrated audio deepfake detection with self-supervised representations
Octavian Pascu, Adriana Stan, Dan Oneata et al.
Towards Improving NAM-to-Speech Synthesis Intelligibility using Self-Supervised Speech Models
Neil Shah, Shirish Karande, Vineet Gandhi
Towards Intelligent Speech Assistants in Operating Rooms: A Multimodal Model for Surgical Workflow Analysis
Kubilay Can Demir, Belén Lojo Rodríguez, Tobias Weise et al.
Towards interfacing large language models with ASR systems using confidence measures and prompting
Maryam Naderi, Enno Hermann, Alexandre Nanchen et al.
Towards measuring fairness in speech recognition: Fair-Speech dataset
Irina-Elena Veliche, Zhuangqun Huang, Vineeth Ayyat Kochaniyan et al.
Towards Multilingual Audio-Visual Question Answering
Orchid Chetia Phukan, Priyabrata Mallick, Swarup Ranjan Behera et al.
Towards Naturalistic Voice Conversion: NaturalVoices Dataset with an Automatic Processing Pipeline
Ali N. Salman, Zongyang Du, Shreeram Suresh Chandra et al.
Towards objective and interpretable speech disorder assessment: a comparative analysis of CNN and transformer-based models
Malo Maisonneuve, Corinne Fredouille, Muriel Lalain et al.
Towards Realistic Emotional Voice Conversion using Controllable Emotional Intensity
Tianhua Qi, Shiyan Wang, Cheng Lu et al.
Towards realtime co-speech gestures synthesis using STARGATE
Louis Abel, Vincent Colotte, Slim Ouni
Towards Rehearsal-Free Multilingual ASR: A LoRA-based Case Study on Whisper
Tianyi Xu, Kaixun Huang, Pengcheng Guo et al.
Towards Responsible Speech Processing
Isabel Trancoso
Towards Robust Few-shot Class Incremental Learning in Audio Classification using Contrastive Representation
Riyansha Singh, Parinita Nema, Vinod K Kurmi
Towards Scalable Remote Assessment of Mild Cognitive Impairment Via Multimodal Dialog
Oliver Roesler, Jackson Liscombe, Michael Neumann et al.
Towards Self-Attention Understanding for Automatic Articulatory Processes Analysis in Cleft Lip and Palate Speech
Ilja Baumann, Dominik Wagner, Maria Schuster et al.
Towards Speech Classification from Acoustic and Vocal Tract data in Real-time MRI
Yaoyao Yue, Michael Proctor, Luping Zhou et al.
Towards Speech-to-Pictograms Translation
Cécile Macaire, Chloé Dion, Didier Schwab et al.