Papers
8,761 papers found
Arabic Dysarthric Speech Recognition Using Adversarial and Signal-Based Augmentation
Massa Baali, Ibrahim Almakky, Shady Shehata et al.
A Relationship Between Vocal Fold Vibration and Droplet Production
Tsukasa Yoshinaga, Takayuki Arai, Akiyoshi Iida
Are retroflex-to-dental sibilant substitutions in Polish children's speech an example of a covert contrast? A preliminary acoustic study
Zuzanna Miodonska, Claartje Levelt, Natalia Mocko et al.
A Simple RNN Model for Lightweight, Low-compute and Low-latency Multichannel Speech Enhancement in the Time Domain
Ashutosh Pandey, Ke Tan, Buye Xu
Asking Questions: an Innovative Way to Interact with Oral History Archives
Jan Švec, Martin Bulín, Adam Frémund et al.
A Snoring Sound Dataset for Body Position Recognition: Collection, Annotation, and Analysis
Li Xiao, Xiuping Yang, Xinhong Li et al.
ASR and Emotional Speech: A Word-Level Investigation of the Mutual Impact of Speech and Emotion Recognition
Yuanchao Li, Zeyu Zhao, Ondřej Klejch et al.
ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversion
Edresson Casanova, Christopher Shulby, Alexander Korolev et al.
ASR for Low Resource and Multilingual Noisy Code-Mixed Speech
Tushar Verma, Atul Shree, Ashutosh Modi
Assessing Intelligibility in Non-native Speech: Comparing Measures Obtained at Different Levels
Xing Wei, Roeland van Hout, Catia Cucchiarini et al.
Assessing Phrase Break of ESL Speech with Pre-trained Language Models and Large Language Models
Zhiyi Wang, Shaoguang Mao, Wenshan Wu et al.
Assessment of Non-Native Speech Intelligibility using Wav2vec2-based Mispronunciation Detection and Multi-level Goodness of Pronunciation Transformer
Ram C. M. C. Shekar, Mu Yang, Kevin Hirschi et al.
A stimulus-organism-response model of willingness to buy from advertising speech using voice quality
Mizuki Nagano, Yusuke Ijima, Sadao Hiroya
A Study on Prosodic Entrainment in Relation to Therapist Empathy in Counseling Conversation
Dehua Tao, Tan Lee, Harold Chui et al.
A Study on the Importance of Formant Transitions for Stop-Consonant Classification in VCV Sequence
Siddarth Chandrasekar, Arvind Ramesh, Tilak Purohit et al.
A Study on Using Duration and Formant Features in Automatic Detection of Speech Sound Disorder in Children
Si-Ioi Ng, Cymie Wing-Yee Ng, Tan Lee
A Study on Visualization of Voiceprint Feature
Jian Zhang, Liang He, Xiaochen Guo et al.
A Stutter Seldom Comes Alone – Cross-Corpus Stuttering Detection as a Multi-label Problem
Sebastian P. Bayerl, Dominik Wagner, Ilja Baumann et al.
A System for Generating Voice Source Signals that Implements the Transformed LF-model Parameter Control
Zihan Wang, Christer Gobl
A Teacher-Student Approach for Extracting Informative Speaker Embeddings From Speech Mixtures
Tobias Cord-Landwehr, Christoph Boeddeker, Cătălin Zorilă et al.
A Training and Inference Strategy Using Noisy and Enhanced Speech as Target for Speech Enhancement without Clean Speech
Li-Wei Chen, Yao-Fei Cheng, Hung-Shin Lee et al.
Attention-based Encoder-Decoder Network for End-to-End Neural Speaker Diarization with Target Speaker Attractor
Zhengyang Chen, Bing Han, Shuai Wang et al.
Attention-based Interactive Disentangling Network for Instance-level Emotional Voice Conversion
Yun Chen, Lingxiao Yang, Qi Chen et al.
Attention Gate Between Capsules in Fully Capsule-Network Speech Recognition
Kyungmin Lee, Hyeontaek Lim, Mun-Hwan Lee et al.