Papers
8,761 papers found
Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection from Speech in Real-World Operative Conditions
Moreno La Quatra, Maria Francesca Turco, Torbjørn Svendsen et al.
Exploiting Wavelet Scattering Transform for an Unsupervised Speaker Diarization in Deep Neural Network Framework
Arunav Arya, Murtiza Ali, Karan Nathwani
Exploring adaptation techniques of large speech foundation models for low-resource ASR: a case study on Northern Sámi
Yaroslav Getman, Tamas Grosz, Katri Hiovain-Asikainen et al.
Exploring compressibility of transformer based text-to-music (TTM) models
Vasileios Moschopoulos, Thanasis Kotsiopoulos, Pablo Peso Parada et al.
Exploring Energy-Based Models for Out-of-Distribution Detection in Dialect Identification
Yaqian Hao, Chenguang Hu, Yingying Gao et al.
Exploring Gender-Specific Speech Patterns in Automatic Suicide Risk Assessment
Maurice Gerczuk, Shahin Amiriparian, Justina Lutz et al.
Exploring Impact of Pausing and Lexical Stress Patterns on L2 English Comprehensibility in Real Time
Sylvain Coulange, Tsuneo Kato, Solange Rossato et al.
Exploring In-Context Learning of Textless Speech Language Model for Speech Classification Tasks
Kai-Wei Chang, Ming-Hao Hsu, Shan-Wen Li et al.
Exploring Multilingual Unseen Speaker Emotion Recognition: Leveraging Co-Attention Cues in Multitask Learning
Arnav Goel, Medha Hira, Anubha Gupta
Exploring Pre-trained Speech Model for Articulatory Feature Extraction in Dysarthric Speech Using ASR
Yuqin Lin, Longbiao Wang, Jianwu Dang et al.
Exploring Self-supervised Embeddings and Synthetic Data Augmentation for Robust Audio Deepfake Detection
Juan M. Martín-Doñas, Aitor Álvarez, Eros Rosello et al.
Exploring Self-Supervised Multi-view Contrastive Learning for Speech Emotion Recognition with Limited Annotations
Bulat Khaertdinov, Pedro Jeruis, Annanda Sousa et al.
Exploring Self-Supervised Speech Representations for Cross-lingual Acoustic-to-Articulatory Inversion
Yun Hao, Reihaneh Amooie, Wietse de Vries et al.
Exploring Sentence Type Effects on the Lombard Effect and Intelligibility Enhancement: A Comparative Study of Natural and Grid Sentences
Hongyang Chen, Yuhong Yang, Zhongyuan Wang et al.
Exploring Speech Foundation Models for Speaker Diarization in Child-Adult Dyadic Interactions
Anfeng Xu, Kevin Huang, Tiantian Feng et al.
Exploring Spoken Language Identification Strategies for Automatic Transcription of Multilingual Broadcast and Institutional Speech
Martina Valente, Fabio Brugnara, Giovanni Morrone et al.
Exploring Syllable Discriminability during Diadochokinetic Task with Increasing Dysarthria Severity for Patients with Amyotrophic Lateral Sclerosis
Neelesh Samptur, Tanuka Bhattacharjee, Anirudh Chakravarty K et al.
Exploring the anatomy of articulation rate in spontaneous English speech: relationships between utterance length effects and social factors
James Tanner, Morgan Sonderegger, Jane Stuart-Smith et al.
Exploring the Benefits of Tokenization of Discrete Acoustic Units
Avihu Dekel, Raul Fernandez
Exploring the Capability of Mamba in Speech Applications
Koichi Miyazaki, Yoshiki Masuyama, Masato Murata
Exploring the Complementary Nature of Speech and Eye Movements for Profiling Neurological Disorders
Yuzhe Wang, Anna Favaro, Thomas Thebaud et al.
Exploring the limits of decoder-only models trained on public speech recognition corpora
Ankit Gupta, George Saon, Brian Kingsbury
Exploring the Robustness of Text-to-Speech Synthesis Based on Diffusion Probabilistic Models to Heavily Noisy Transcriptions
Jingyi Feng, Yusuke Yasuda, Tomoki Toda
Expressive paragraph text-to-speech synthesis with multi-step variational autoencoder
Xuyuan Li, Zengqiang Shang, Peiyang Shi et al.
Extraction of interpretable and shared speaker-specific speech attributes through binary auto-encoder
Imen Ben-Amor, Jean-Francois Bonastre, Salima Mdhaffar