Papers
A Cross-Dialectal Comparison of Apical Vowels in Beijing Mandarin, Northeastern Mandarin and Southwestern Mandarin: An EMA and Ultrasound Study
Jing Huang, Feng-fan Hsieh, Yueh-chin Chang
Act-Aware Slot-Value Predicting in Multi-Domain Dialogue State Tracking
Ruolin Su, Ting-Wei Wu, Biing-Hwang Juang
Acted vs. Improvised: Domain Adaptation for Elicitation Approaches in Audio-Visual Emotion Recognition
Haoqi Li, Yelin Kim, Cheng-Hao Kuo et al.
Active Speaker Detection as a Multi-Objective Optimization with Uncertainty-Based Multimodal Fusion
Baptiste Pouthier, Laurent Pilati, Leela K. Gudupudi et al.
Adapt-and-Adjust: Overcoming the Long-Tail Problem of Multilingual Speech Recognition
Genta Indra Winata, Guangsen Wang, Caiming Xiong et al.
Adapting Long Context NLM for ASR Rescoring in Conversational Agents
Ashish Shenoy, Sravan Bodapati, Monica Sunkara et al.
Adapting Speaker Embeddings for Speaker Diarisation
Youngki Kwon, Jee-weon Jung, Hee-Soo Heo et al.
Adaptive Convolutional Neural Network for Text-Independent Speaker Recognition
Seong-Hu Kim, Yong-Hwa Park
Adaptive Listening Difficulty Detection for L2 Learners Through Moderating ASR Resources
Maryam Sadat Mirzaei, Kourosh Meshgi
Adaptive Listening to Everyday Soundscapes
Mounya Elhilali
Adaptive Margin Circle Loss for Speaker Verification
Runqiu Xiao, Xiaoxiao Miao, Wenchao Wang et al.
Adaptive Text to Speech for Spontaneous Style
Yuzi Yan, Xu Tan, Bohan Li et al.
Additive Phoneme-Aware Margin Softmax Loss for Language Recognition
Zheng Li, Yan Liu, Lin Li et al.
Addressing Compliance in Call Centers with Entity Extraction
Sai Guruju, Jithendra Vepa
A Deep and Recurrent Architecture for Primate Vocalization Classification
Robert Müller, Steffen Illium, Claudia Linnhoff-Popien
A Deep Learning Approach to Multi-Channel and Multi-Microphone Acoustic Echo Cancellation
Hao Zhang, DeLiang Wang
A Deep Learning Method to Multi-Channel Active Noise Control
Hao Zhang, DeLiang Wang
A Deliberation-Based Joint Acoustic and Text Decoder
Sepand Mavandadi, Tara N. Sainath, Ke Hu et al.
ADEPT: A Dataset for Evaluating Prosody Transfer
Alexandra Torresquintero, Tian Huey Teh, Christopher G.R. Wallis et al.
A Discriminative Entity-Aware Language Model for Virtual Assistants
Mandana Saebi, Ernest Pusateri, Aaksha Meghawat et al.
Adjunct-Emeritus Distillation for Semi-Supervised Language Model Adaptation
Scott Novotney, Yile Gu, Ivan Bulyko
Advanced Long-Context End-to-End Speech Recognition Using Context-Expanded Transformers
Takaaki Hori, Niko Moritz, Chiori Hori et al.
Advanced Semi-Blind Speaker Extraction and Tracking Implemented in Experimental Device with Revolving Dense Microphone Array
J. Čmejla, T. Kounovský, J. Janský et al.
Advances in Integration of End-to-End Neural and Clustering-Based Diarization for Real Conversational Speech
Keisuke Kinoshita, Marc Delcroix, Naohiro Tawara
Adversarial Data Augmentation for Disordered Speech Recognition
Zengrui Jin, Mengzhe Geng, Xurong Xie et al.