Papers
Zero-Shot Foreign Accent Conversion without a Native Reference
Waris Quamer, Anurag Das, John Levis et al.
Zero-Shot Voice Conditioning for Denoising Diffusion TTS Models
Alon Levkovitch, Eliya Nachmani, Lior Wolf
4-Bit Quantization of LSTM-Based Speech Recognition Models
Andrea Fasoli, Chia-Yu Chen, Mauricio Serrano et al.
A Benchmark of Dynamical Variational Autoencoders Applied to Speech Spectrogram Modeling
Xiaoyu Bie, Laurent Girin, Simon Leglaive et al.
A Causal U-Net Based Neural Beamforming Network for Real-Time Multi-Channel Speech Enhancement
Xinlei Ren, Xu Zhang, Lianwu Chen et al.
A Comparative Study of Different EMG Features for Acoustics-to-EMG Mapping
Manthan Sharma, Navaneetha Gaddam, Tejas Umesh et al.
A Comparative Study on Neural Architectures and Training Methods for Japanese Speech Recognition
Shigeki Karita, Yotaro Kubo, Michiel Adriaan Unico Bacchiani et al.
A Comparative Study on Recent Neural Spoofing Countermeasures for Synthetic Speech Detection
Xin Wang, Junichi Yamagishi
A Comparison of Acoustic Correlates of Voice Quality Across Different Recording Devices: A Cautionary Tale
Joshua Penney, Andy Gibson, Felicity Cox et al.
A Comparison of Supervised and Unsupervised Pre-Training of End-to-End Models
Ananya Misra, Dongseong Hwang, Zhouyuan Huo et al.
A Comparison of the Accuracy of Dissen and Keshet’s (2016) DeepFormants and Traditional LPC Methods for Semi-Automatic Speaker Recognition
Thomas Coy, Vincent Hughes, Philip Harrison et al.
A Context-Aware Hierarchical BERT Fusion Network for Multi-Turn Dialog Act Detection
Ting-Wei Wu, Ruolin Su, Biing-Hwang Juang
Acoustic and Prosodic Correlates of Emotions in Urdu Speech
Saba Urooj, Benazir Mumtaz, Sarmad Hussain et al.
Acoustic Data-Driven Subword Modeling for End-to-End Speech Recognition
Wei Zhou, Mohammad Zeineldeen, Zuoyun Zheng et al.
Acoustic Echo Cancellation Using Deep Complex Neural Network with Nonlinear Magnitude Compression and Phase Information
Renhua Peng, Linjuan Cheng, Chengshi Zheng et al.
Acoustic Echo Cancellation with Cross-Domain Learning
Lukas Pfeifenberger, Matthias Zoehrer, Franz Pernkopf
Acoustic Event Detection with Classifier Chains
Tatsuya Komatsu, Shinji Watanabe, Koichi Miyazaki et al.
Acoustic Features and Neural Representations for Categorical Emotion Recognition from Speech
Aaron Keesing, Yun Sing Koh, Michael Witbrock
Acoustic Indicators of Speech Motor Coordination in Adults With and Without Traumatic Brain Injury
Tanya Talkar, Nancy Pearl Solomon, Douglas S. Brungart et al.
Acoustic-Prosodic, Lexical and Demographic Cues to Persuasiveness in Competitive Debate Speeches
Huyen Nguyen, Ralph Vente, David Lupea et al.
Acoustic Scene Classification Using Kervolution-Based SubSpectralNet
Ritika Nandi, Shashank Shekhar, Manjunath Mulimani
Acquisition of Prosodic Focus Marking by Three- to Six-Year-Old Children Learning Mandarin Chinese
Qianyutong Zhang, Kexin Lyu, Zening Chen et al.
A Cross-Dialectal Comparison of Apical Vowels in Beijing Mandarin, Northeastern Mandarin and Southwestern Mandarin: An EMA and Ultrasound Study
Jing Huang, Feng-fan Hsieh, Yueh-chin Chang
Act-Aware Slot-Value Predicting in Multi-Domain Dialogue State Tracking
Ruolin Su, Ting-Wei Wu, Biing-Hwang Juang