Papers
Speaker Verification Across Ages: Investigating Deep Speaker Embedding Sensitivity to Age Mismatch in Enrollment and Test Speech
Vishwanath Pratap Singh, Md Sahidullah, Tomi Kinnunen
Speak & Improve: L2 English Speaking Practice Tool
Diane Nicholls, Kate M. Knill, Mark J. F. Gales et al.
Speaking Clearly, Understanding Better: Predicting the L2 Narrative Comprehension of Chinese Bilingual Kindergarten Children Based on Speech Intelligibility Using a Machine Learning Approach
Hiuching Hung, Paula A. Pérez-Toro, Tomás Arias-Vergara et al.
Speaking State Decoder with Transition Detection for Next Speaker Prediction
Shao-Hao Lu, Yun-Shao Lin, Chi-Chun Lee
Speech Aware Dialog System Technology Challenge (DSTC11)
Hagen Soltau, Izhak Shafran, Mingqiu Wang et al.
Speech-Based Classification of Defensive Communication: A Novel Dataset and Results
Shahin Amiriparian, Lukas Christ, Regina Kushtanova et al.
Speech Breathing Behavior During Pauses in Children
Delphine Charuau, Béatrice Vaxelaire, Rudolph Sock
Speech Emotion Recognition by Estimating Emotional Label Sequences with Phoneme Class Attribute
Ryotaro Nagase, Takahiro Fukumori, Yoichi Yamashita
Speech Emotion Recognition using Decomposed Speech via Multi-task Learning
Jia-Hao Hsu, Chung-Hsien Wu, Yu-Hung Wei
Speech Enhancement Patterns in Human-Robot Interaction: A Cross-Linguistic Perspective
Jacek Kudera, Katharina Zahner-Ritter, Jakob Engel et al.
Speech Entrainment in Chinese Story-Style Talk Shows: The Interaction Between Gender and Role
Yanting Sun, Hongwei Ding
SpeechGLUE: How Well Can Self-Supervised Speech Models Capture Linguistic Knowledge?
Takanori Ashihara, Takafumi Moriya, Kohei Matsuura et al.
Speech inpainting: Context-based speech synthesis guided by video
Juan Felipe Montesinos, Daniel Michelsanti, Gloria Haro et al.
Speech-in-Speech Recognition is Modulated by Familiarity to Dialect
Jessica L. L. Chin, Elena Talevska, Mark Antoniou
Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Uncertainty Quantification
Eun Jung Yeo, Kwanghee Choi, Sunhee Kim et al.
Speech reduction: position within French prosodic structure
Kübra Bodur, Roxane Bertrand, James S. German et al.
Speech Self-Supervised Representation Benchmarking: Are We Doing it Right?
Salah Zaiem, Youcef Kemiche, Titouan Parcollet et al.
Speech Synthesis from Articulatory Movements Recorded by Real-time MRI
Yuto Otani, Shun Sawada, Hidefumi Ohmura et al.
Speech Synthesis with Self-Supervisedly Learnt Prosodic Representations
Zhao-Ci Liu, Zhen-Hua Ling, Ya-Jun Hu et al.
Speech Taskonomy: Which Speech Tasks are the most Predictive of fMRI Brain Activity?
Subba Reddy Oota, Veeral Agarwal, Mounika Marreddy et al.
Speech-to-Face Conversion Using Denoising Diffusion Probabilistic Models
Shuhei Kato, Taiichi Hashimoto
SpellMapper: A non-autoregressive neural spellchecker for ASR customization with candidate retrieval based on n-gram mappings
Alexandra Antonova, Evelina Bakhturina, Boris Ginsburg
Spoken Language Identification System for English-Mandarin Code-Switching Child-Directed Speech
Shashi Kant Gupta, Sushant Hiray, Prashant Kukde
Spoofing Attacker Also Benefits from Self-Supervised Pretrained Model
Aoi Ito, Shota Horiguchi