Research Explorer

Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection from Speech in Real-World Operative Conditions

Moreno La Quatra, Maria Francesca Turco, Torbjørn Svendsen et al.

2024 INTERSPEECH

Exploiting Wavelet Scattering Transform for an Unsupervised Speaker Diarization in Deep Neural Network Framework

Arunav Arya, Murtiza Ali, Karan Nathwani

2024 INTERSPEECH

Exploring adaptation techniques of large speech foundation models for low-resource ASR: a case study on Northern Sámi

Yaroslav Getman, Tamas Grosz, Katri Hiovain-Asikainen et al.

2024 INTERSPEECH

Exploring compressibility of transformer based text-to-music (TTM) models

Vasileios Moschopoulos, Thanasis Kotsiopoulos, Pablo Peso Parada et al.

2024 INTERSPEECH

Exploring Energy-Based Models for Out-of-Distribution Detection in Dialect Identification

Yaqian Hao, Chenguang Hu, Yingying Gao et al.

2024 INTERSPEECH

Exploring Gender-Specific Speech Patterns in Automatic Suicide Risk Assessment

Maurice Gerczuk, Shahin Amiriparian, Justina Lutz et al.

2024 INTERSPEECH

Exploring Impact of Pausing and Lexical Stress Patterns on L2 English Comprehensibility in Real Time

Sylvain Coulange, Tsuneo Kato, Solange Rossato et al.

2024 INTERSPEECH

Exploring In-Context Learning of Textless Speech Language Model for Speech Classification Tasks

Kai-Wei Chang, Ming-Hao Hsu, Shan-Wen Li et al.

2024 INTERSPEECH

Exploring Multilingual Unseen Speaker Emotion Recognition: Leveraging Co-Attention Cues in Multitask Learning

Arnav Goel, Medha Hira, Anubha Gupta

2024 INTERSPEECH

Exploring Pre-trained Speech Model for Articulatory Feature Extraction in Dysarthric Speech Using ASR

Yuqin Lin, Longbiao Wang, Jianwu Dang et al.

2024 INTERSPEECH

Exploring Self-supervised Embeddings and Synthetic Data Augmentation for Robust Audio Deepfake Detection

Juan M. Martín-Doñas, Aitor Álvarez, Eros Rosello et al.

2024 INTERSPEECH

Exploring Self-Supervised Multi-view Contrastive Learning for Speech Emotion Recognition with Limited Annotations

Bulat Khaertdinov, Pedro Jeruis, Annanda Sousa et al.

2024 INTERSPEECH

Exploring Self-Supervised Speech Representations for Cross-lingual Acoustic-to-Articulatory Inversion

Yun Hao, Reihaneh Amooie, Wietse de Vries et al.

2024 INTERSPEECH

Exploring Sentence Type Effects on the Lombard Effect and Intelligibility Enhancement: A Comparative Study of Natural and Grid Sentences

Hongyang Chen, Yuhong Yang, Zhongyuan Wang et al.

2024 INTERSPEECH

Exploring Speech Foundation Models for Speaker Diarization in Child-Adult Dyadic Interactions

Anfeng Xu, Kevin Huang, Tiantian Feng et al.

2024 INTERSPEECH

Exploring Spoken Language Identification Strategies for Automatic Transcription of Multilingual Broadcast and Institutional Speech

Martina Valente, Fabio Brugnara, Giovanni Morrone et al.

2024 INTERSPEECH

Exploring Syllable Discriminability during Diadochokinetic Task with Increasing Dysarthria Severity for Patients with Amyotrophic Lateral Sclerosis

Neelesh Samptur, Tanuka Bhattacharjee, Anirudh Chakravarty K et al.

2024 INTERSPEECH

Exploring the anatomy of articulation rate in spontaneous English speech: relationships between utterance length effects and social factors

James Tanner, Morgan Sonderegger, Jane Stuart-Smith et al.

2024 INTERSPEECH

Exploring the Benefits of Tokenization of Discrete Acoustic Units

Avihu Dekel, Raul Fernandez

2024 INTERSPEECH

Exploring the Capability of Mamba in Speech Applications

Koichi Miyazaki, Yoshiki Masuyama, Masato Murata

2024 INTERSPEECH

Exploring the Complementary Nature of Speech and Eye Movements for Profiling Neurological Disorders

Yuzhe Wang, Anna Favaro, Thomas Thebaud et al.

2024 INTERSPEECH

Exploring the limits of decoder-only models trained on public speech recognition corpora

Ankit Gupta, George Saon, Brian Kingsbury

2024 INTERSPEECH

Exploring the Robustness of Text-to-Speech Synthesis Based on Diffusion Probabilistic Models to Heavily Noisy Transcriptions

Jingyi Feng, Yusuke Yasuda, Tomoki Toda

2024 INTERSPEECH

Expressive paragraph text-to-speech synthesis with multi-step variational autoencoder

Xuyuan Li, Zengqiang Shang, Peiyang Shi et al.

2024 INTERSPEECH

Extraction of interpretable and shared speaker-specific speech attributes through binary auto-encoder

Imen Ben-Amor, Jean-Francois Bonastre, Salima Mdhaffar

2024 INTERSPEECH

Papers