Research Explorer

Towards a Quantitative Analysis of Coarticulation with a Phoneme-to-Articulatory Model

Chaofei Fan, Jaimie M. Henderson, Chris Manning et al.

2024 INTERSPEECH

Towards Audio Codec-based Speech Separation

Jia Qi Yip, Shengkui Zhao, Dianwen Ng et al.

2024 INTERSPEECH

Towards Classifying Mother Tongue from Infant Cries - Findings Substantiating Prenatal Learning Theory

Tim Polzehl, Tim Herzig, Friedrich Wicke et al.

2024 INTERSPEECH

Towards Effective and Efficient Non-autoregressive Decoding Using Block-based Attention Mask

Tianzi Wang, Xurong Xie, Zhaoqing Li et al.

2024 INTERSPEECH

Towards EMG-to-Speech with Necklace Form Factor

Peter Wu, Ryan Kaveh, Raghav Nautiyal et al.

2024 INTERSPEECH

Towards End-to-End Unified Recognition for Mandarin and Cantonese

Meiling Chen, Pengjie Liu, Heng Yang et al.

2024 INTERSPEECH

Towards Explainable Monaural Speaker Separation with Auditory-based Training

Hassan Taherian, Vahid Ahmadi Kalkhorani, Ashutosh Pandey et al.

2024 INTERSPEECH

Towards Expressive Zero-Shot Speech Synthesis with Hierarchical Prosody Modeling

Yuepeng Jiang, Tao Li, Fengyu Yang et al.

2024 INTERSPEECH

Towards generalisable and calibrated audio deepfake detection with self-supervised representations

Octavian Pascu, Adriana Stan, Dan Oneata et al.

2024 INTERSPEECH

Towards Improving NAM-to-Speech Synthesis Intelligibility using Self-Supervised Speech Models

Neil Shah, Shirish Karande, Vineet Gandhi

2024 INTERSPEECH

Towards Intelligent Speech Assistants in Operating Rooms: A Multimodal Model for Surgical Workflow Analysis

Kubilay Can Demir, Belén Lojo Rodríguez, Tobias Weise et al.

2024 INTERSPEECH

Towards interfacing large language models with ASR systems using confidence measures and prompting

Maryam Naderi, Enno Hermann, Alexandre Nanchen et al.

2024 INTERSPEECH

Towards measuring fairness in speech recognition: Fair-Speech dataset

Irina-Elena Veliche, Zhuangqun Huang, Vineeth Ayyat Kochaniyan et al.

2024 INTERSPEECH

Towards Multilingual Audio-Visual Question Answering

Orchid Chetia Phukan, Priyabrata Mallick, Swarup Ranjan Behera et al.

2024 INTERSPEECH

Towards Naturalistic Voice Conversion: NaturalVoices Dataset with an Automatic Processing Pipeline

Ali N. Salman, Zongyang Du, Shreeram Suresh Chandra et al.

2024 INTERSPEECH

Towards objective and interpretable speech disorder assessment: a comparative analysis of CNN and transformer-based models

Malo Maisonneuve, Corinne Fredouille, Muriel Lalain et al.

2024 INTERSPEECH

Towards Realistic Emotional Voice Conversion using Controllable Emotional Intensity

Tianhua Qi, Shiyan Wang, Cheng Lu et al.

2024 INTERSPEECH

Towards realtime co-speech gestures synthesis using STARGATE

Louis Abel, Vincent Colotte, Slim Ouni

2024 INTERSPEECH

Towards Rehearsal-Free Multilingual ASR: A LoRA-based Case Study on Whisper

Tianyi Xu, Kaixun Huang, Pengcheng Guo et al.

2024 INTERSPEECH

Towards Responsible Speech Processing

Isabel Trancoso

2024 INTERSPEECH

Towards Robust Few-shot Class Incremental Learning in Audio Classification using Contrastive Representation

Riyansha Singh, Parinita Nema, Vinod K Kurmi

2024 INTERSPEECH

Towards Scalable Remote Assessment of Mild Cognitive Impairment Via Multimodal Dialog

Oliver Roesler, Jackson Liscombe, Michael Neumann et al.

2024 INTERSPEECH

Towards Self-Attention Understanding for Automatic Articulatory Processes Analysis in Cleft Lip and Palate Speech

Ilja Baumann, Dominik Wagner, Maria Schuster et al.

2024 INTERSPEECH

Towards Speech Classification from Acoustic and Vocal Tract data in Real-time MRI

Yaoyao Yue, Michael Proctor, Luping Zhou et al.

2024 INTERSPEECH

Towards Speech-to-Pictograms Translation

Cécile Macaire, Chloé Dion, Didier Schwab et al.

2024 INTERSPEECH

Papers