Papers
Automatic Classification of Phonation Types in Spontaneous Speech: Towards a New Workflow for the Characterization of Speakers’ Voice Quality
Anaïs Chanclu, Imen Ben Amor, Cédric Gendrot et al.
Automatic Detection and Assessment of Alzheimer Disease Using Speech and Language Technologies in Low-Resource Scenarios
Raghavendra Pappagari, Jaejin Cho, Sonal Joshi et al.
Automatic Detection of Alzheimer’s Disease Using Spontaneous Speech Only
Jun Chen, Jieping Ye, Fengyi Tang et al.
Automatic Detection of Shouted Speech Segments in Indian News Debates
Shikha Baghel, Mrinmoy Bhattacharjee, S.R. Mahadeva Prasanna et al.
Automatic Error Correction for Speaker Embedding Learning with Noisy Labels
Fuchuan Tong, Yan Liu, Song Li et al.
Automatic Extraction of Speech Rhythm Descriptors for Speech Intelligibility Assessment in the Context of Head and Neck Cancers
Robin Vaysse, Jérôme Farinas, Corine Astésano et al.
Automatic Lip-Reading with Hierarchical Pyramidal Convolution and Self-Attention for Image Sequences with No Word Boundaries
Hang Chen, Jun Du, Yu Hu et al.
Automatic Radiology Report Editing Through Voice
Manh Hung Nguyen, Vu Hoang, Tu Anh Nguyen et al.
Automatic Severity Classification of Korean Dysarthric Speech Using Phoneme-Level Pronunciation Features
Eun Jung Yeo, Sunhee Kim, Minhwa Chung
Automatic Speech Recognition of Disordered Speech: Personalized Models Outperforming Human Listeners on Short Phrases
Jordan R. Green, Robert L. MacDonald, Pan-Pan Jiang et al.
Automatic Speech Recognition Systems Errors for Objective Sleepiness Detection Through Voice
Vincent P. Martin, Jean-Luc Rouas, Florian Boyer et al.
Autonomous Robot for Measuring Room Impulse Responses
Stefan Fragner, Tobias Topar, Maximilian Giller et al.
Auxiliary Loss Function for Target Speech Extraction and Recognition with Weak Supervision Based on Speaker Characteristics
Katerina Zmolikova, Marc Delcroix, Desh Raj et al.
Auxiliary Sequence Labeling Tasks for Disfluency Detection
Dongyub Lee, Byeongil Ko, Myeong Cheol Shin et al.
AvaTr: One-Shot Speaker Extraction with Transformers
Shell Xu Hu, Md. Rifat Arefin, Viet-Nhat Nguyen et al.
AVLnet: Learning Audio-Visual Language Representations from Instructional Videos
Andrew Rouditchenko, Angie Boggust, David Harwath et al.
A Voice-Activated Switch for Persons with Motor and Speech Impairments: Isolated-Vowel Spotting Using Neural Networks
Shanqing Cai, Lisie Lillianfeld, Katie Seaver et al.
A Weight Moving Average Based Alternate Decoupled Learning Algorithm for Long-Tailed Language Identification
Hui Wang, Lin Liu, Yan Song et al.
BART Based Semantic Correction for Mandarin Automatic Speech Recognition System
Yun Zhao, Xuerui Yang, Jinchao Wang et al.
Basis-MelGAN: Efficient Neural Vocoder Based on Audio Decomposition
Zhengxi Liu, Yanmin Qian
Bayesian Parametric and Architectural Domain Adaptation of LF-MMI Trained TDNNs for Elderly and Dysarthric Speech Recognition
Jiajun Deng, Fabian Ritter Gutierrez, Shoukang Hu et al.
Beey: More Than a Speech-to-Text Editor
Lenka Weingartová, Veronika Volná, Ewa Balejová
BERT-Based Semantic Model for Rescoring N-Best Speech Recognition List
Dominique Fohr, Irina Illina
Best of Both Worlds: Robust Accented Speech Recognition with Adversarial Transfer Learning
Nilaksh Das, Sravan Bodapati, Monica Sunkara et al.
Bi-Directional Joint Neural Networks for Intent Classification and Slot Filling
Soyeon Caren Han, Siqu Long, Huichun Li et al.