Papers
A Robust and Cascaded Acoustic Echo Cancellation Based on Deep Learning
Chenggang Zhang, Xueliang Zhang
ARVC: An Auto-Regressive Voice Conversion System Without Parallel Training Data
Zheng Lian, Zhengqi Wen, Xinyong Zhou et al.
ASAPP-ASR: Multistream CNN and Self-Attentive SRU for SOTA Speech Recognition
Jing Pan, Joshua Shapiro, Jeremy Wohlwend et al.
A Semi-Blind Source Separation Approach for Speech Dereverberation
Ziteng Wang, Yueyue Na, Zhang Liu et al.
A Sound Engineering Approach to Near End Listening Enhancement
Carol Chermaz, Simon King
A Space-and-Speaker-Aware Iterative Mask Estimation Approach to Multi-Channel Speech Recognition in the CHiME-6 Challenge
Yan-Hui Tu, Jun Du, Lei Sun et al.
ASR-Based Evaluation and Feedback for Individualized Reading Practice
Yu Bai, Ferdy Hubers, Catia Cucchiarini et al.
ASR Error Correction with Augmented Transformer for Entity Retrieval
Haoyu Wang, Shuyan Dong, Yue Liu et al.
ASR-Free Pronunciation Assessment
Sitong Cheng, Zhixin Liu, Lantian Li et al.
Assessment of Parkinson’s Disease Medication State Through Automatic Speech Analysis
Anna Pompili, Rubén Solera-Ureña, Alberto Abad et al.
Asteroid: The PyTorch-Based Audio Source Separation Toolkit for Researchers
Manuel Pariente, Samuele Cornell, Joris Cosentino et al.
ATCSpeech: A Multilingual Pilot-Controller Speech Corpus from Real Air Traffic Control Environment
Bo Yang, Xianlong Tan, Zhengmao Chen et al.
A Transformer-Based Audio Captioning Model with Keyword Estimation
Yuma Koizumi, Ryo Masumura, Kyosuke Nishida et al.
ATReSN-Net: Capturing Attentive Temporal Relations in Semantic Neighborhood for Acoustic Scene Classification
Liwen Zhang, Jiqing Han, Ziqiang Shi
Atss-Net: Target Speaker Separation via Attention-Based Neural Network
Tingle Li, Qingjian Lin, Yuanyuan Bao et al.
Attention and Encoder-Decoder Based Models for Transforming Articulatory Movements at Different Speaking Rates
Abhayjeet Singh, Aravind Illa, Prasanta Kumar Ghosh
Attention-Based Speaker Embeddings for One-Shot Voice Conversion
Tatsuma Ishihara, Daisuke Saito
Attention-Driven Projections for Soundscape Classification
Dhanunjaya Varma Devalraju, Muralikrishna H., Padmanabhan Rajan et al.
Attention Forcing for Speech Synthesis
Qingyun Dou, Joshua Efiong, Mark J.F. Gales
Attention to Indexical Information Improves Voice Recall
Grant L. McGuire, Molly Babel
Attention Wave-U-Net for Acoustic Echo Cancellation
Jung-Hee Kim, Joon-Hyuk Chang
Attentive Convolutional Recurrent Neural Network Using Phoneme-Level Acoustic Representation for Rare Sound Event Detection
Shreya G. Upadhyay, Bo-Hao Su, Chi-Chun Lee
Attentron: Few-Shot Text-to-Speech Utilizing Attention-Based Variable-Length Embedding
Seungwoo Choi, Seungju Han, Dongyoung Kim et al.
Audio Dequantization for High Fidelity Audio Generation in Flow-Based Neural Vocoder
Hyun-Wook Yoon, Sang-Hoon Lee, Hyeong-Rae Noh et al.
Audiovisual Correspondence Learning in Humans and Machines
Venkat Krishnamohan, Akshara Soman, Anshul Gupta et al.