Research Explorer

Arabic Dysarthric Speech Recognition Using Adversarial and Signal-Based Augmentation

Massa Baali, Ibrahim Almakky, Shady Shehata et al.

2023 INTERSPEECH

A Relationship Between Vocal Fold Vibration and Droplet Production

Tsukasa Yoshinaga, Takayuki Arai, Akiyoshi Iida

2023 INTERSPEECH

Are retroflex-to-dental sibilant substitutions in Polish children's speech an example of a covert contrast? A preliminary acoustic study

Zuzanna Miodonska, Claartje Levelt, Natalia Mocko et al.

2023 INTERSPEECH

A Simple RNN Model for Lightweight, Low-compute and Low-latency Multichannel Speech Enhancement in the Time Domain

Ashutosh Pandey, Ke Tan, Buye Xu

2023 INTERSPEECH

Asking Questions: an Innovative Way to Interact with Oral History Archives

Jan Švec, Martin Bulín, Adam Frémund et al.

2023 INTERSPEECH

A Snoring Sound Dataset for Body Position Recognition: Collection, Annotation, and Analysis

Li Xiao, Xiuping Yang, Xinhong Li et al.

2023 INTERSPEECH

ASR and Emotional Speech: A Word-Level Investigation of the Mutual Impact of Speech and Emotion Recognition

Yuanchao Li, Zeyu Zhao, Ondřej Klejch et al.

2023 INTERSPEECH

ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversion

Edresson Casanova, Christopher Shulby, Alexander Korolev et al.

2023 INTERSPEECH

ASR for Low Resource and Multilingual Noisy Code-Mixed Speech

Tushar Verma, Atul Shree, Ashutosh Modi

2023 INTERSPEECH

Assessing Intelligibility in Non-native Speech: Comparing Measures Obtained at Different Levels

Xing Wei, Roeland van Hout, Catia Cucchiarini et al.

2023 INTERSPEECH

Assessing Phrase Break of ESL Speech with Pre-trained Language Models and Large Language Models

Zhiyi Wang, Shaoguang Mao, Wenshan Wu et al.

2023 INTERSPEECH

Assessment of Non-Native Speech Intelligibility using Wav2vec2-based Mispronunciation Detection and Multi-level Goodness of Pronunciation Transformer

Ram C. M. C. Shekar, Mu Yang, Kevin Hirschi et al.

2023 INTERSPEECH

AsthmaSCELNet: A Lightweight Supervised Contrastive Embedding Learning Framework for Asthma Classification Using Lung Sounds

Arka Roy, Udit Satija

2023 INTERSPEECH

A stimulus-organism-response model of willingness to buy from advertising speech using voice quality

Mizuki Nagano, Yusuke Ijima, Sadao Hiroya

2023 INTERSPEECH

A Study on Prosodic Entrainment in Relation to Therapist Empathy in Counseling Conversation

Dehua Tao, Tan Lee, Harold Chui et al.

2023 INTERSPEECH

A Study on the Importance of Formant Transitions for Stop-Consonant Classification in VCV Sequence

Siddarth Chandrasekar, Arvind Ramesh, Tilak Purohit et al.

2023 INTERSPEECH

A Study on Using Duration and Formant Features in Automatic Detection of Speech Sound Disorder in Children

Si-Ioi Ng, Cymie Wing-Yee Ng, Tan Lee

2023 INTERSPEECH

A Study on Visualization of Voiceprint Feature

Jian Zhang, Liang He, Xiaochen Guo et al.

2023 INTERSPEECH

A Stutter Seldom Comes Alone – Cross-Corpus Stuttering Detection as a Multi-label Problem

Sebastian P. Bayerl, Dominik Wagner, Ilja Baumann et al.

2023 INTERSPEECH

A System for Generating Voice Source Signals that Implements the Transformed LF-model Parameter Control

Zihan Wang, Christer Gobl

2023 INTERSPEECH

A Teacher-Student Approach for Extracting Informative Speaker Embeddings From Speech Mixtures

Tobias Cord-Landwehr, Christoph Boeddeker, Cătălin Zorilă et al.

2023 INTERSPEECH

A Training and Inference Strategy Using Noisy and Enhanced Speech as Target for Speech Enhancement without Clean Speech

Li-Wei Chen, Yao-Fei Cheng, Hung-Shin Lee et al.

2023 INTERSPEECH

Attention-based Encoder-Decoder Network for End-to-End Neural Speaker Diarization with Target Speaker Attractor

Zhengyang Chen, Bing Han, Shuai Wang et al.

2023 INTERSPEECH

Attention-based Interactive Disentangling Network for Instance-level Emotional Voice Conversion

Yun Chen, Lingxiao Yang, Qi Chen et al.

2023 INTERSPEECH

Attention Gate Between Capsules in Fully Capsule-Network Speech Recognition

Kyungmin Lee, Hyeontaek Lim, Mun-Hwan Lee et al.

2023 INTERSPEECH

Papers