Papers
Detecting Audio Attacks on ASR Systems with Dropout Uncertainty
Tejas Jayashankar, Jonathan Le Roux, Pierre Moulin
Detection of Subclinical Mild Traumatic Brain Injury (mTBI) Through Speech and Gait
Tanya Talkar, Sophia Yuditskaya, James R. Williamson et al.
Detection of Voicing and Place of Articulation of Fricatives with Deep Learning in a Virtual Speech and Language Therapy Tutor
Ivo Anjos, Maxine Eskenazi, Nuno Marques et al.
Developing an Open-Source Corpus of Yoruba Speech
Alexander Gutkin, Işın Demirşahin, Oddur Kjartansson et al.
Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability
Jinyu Li, Rui Zhao, Zhong Meng et al.
Development of a Speech Quality Database Under Uncontrolled Conditions
Alessandro Ragano, Emmanouil Benetos, Andrew Hines
Development of Multilingual ASR Using GlobalPhone for Less-Resourced Languages: The Case of Ethiopian Languages
Martha Yifiru Tachbelie, Solomon Teferra Abate, Tanja Schultz
Differences in Gradient Emotion Perception: Human vs. Alexa Voices
Michelle Cohn, Eran Raveh, Kristin Predeck et al.
Differential Beamforming for Uniform Circular Array with Directional Microphones
Weilong Huang, Jinwei Feng
Dimensional Emotion Prediction Based on Interactive Context in Conversation
Xiaohan Shi, Sixia Li, Jianwu Dang
DiPCo — Dinner Party Corpus
Maarten Van Segbroeck, Ahmed Zaid, Ksenia Kutsenko et al.
Discovering Articulatory Speech Targets from Synthesized Random Babble
Heikki Rasilo, Yannick Jadoul
Discriminative Method to Extract Coarse Prosodic Structure and its Application for Statistical Phrase/Accent Command Estimation
Yuma Shirahata, Daisuke Saito, Nobuaki Minematsu
Discriminative Singular Spectrum Analysis for Bioacoustic Classification
Bernardo B. Gatto, Eulanda M. dos Santos, Juan G. Colonna et al.
Discriminative Transfer Learning for Optimizing ASR and Semantic Labeling in Task-Oriented Spoken Dialog
Yao Qian, Yu Shi, Michael Zeng
Disfluencies and Fine-Tuning Pre-Trained Language Models for Detection of Alzheimer’s Disease
Jiahong Yuan, Yuchen Bian, Xingyu Cai et al.
Distant Supervision for Polyphone Disambiguation in Mandarin Chinese
Jiawen Zhang, Yuanyuan Zhao, Jiaqi Zhu et al.
Distilling the Knowledge of BERT for Sequence-to-Sequence ASR
Hayato Futami, Hirofumi Inaguma, Sei Ueno et al.
Distributed Summation Privacy for Speech Enhancement
Matt O’Connor, W. Bastiaan Kleijn
DNN No-Reference PSTN Speech Quality Prediction
Gabriel Mittag, Ross Cutler, Yasaman Hosseinkashi et al.
Do End-to-End Speech Recognition Models Care About Context?
Lasse Borgholt, Jakob D. Havtorn, Željko Agić et al.
Does French Listeners’ Ability to Use Accentual Information at the Word Level Depend on the Ear of Presentation?
Amandine Michelas, Sophie Dufour
Does Lexical Retrieval Deteriorate in Patients with Mild Cognitive Impairment? Analysis of Brain Functional Network Will Tell
Chongyuan Lian, Tianqi Wang, Mingxiao Gu et al.
Do Face Masks Introduce Bias in Speech Technologies? The Case of Automated Scoring of Speaking Proficiency
Anastassia Loukina, Keelan Evanini, Matthew Mulholland et al.