Papers
Bidirectional Multiscale Feature Aggregation for Speaker Verification
Jiajun Qi, Wu Guo, Bin Gu
Binary Neural Network for Speaker Verification
Tinglong Zhu, Xiaoyi Qin, Ming Li
Binaural Speech Separation of Moving Speakers With Preserved Spatial Cues
Cong Han, Yi Luo, Nima Mesgarani
Boosting of Contextual Information in ASR for Air-Traffic Call-Sign Recognition
Martin Kocour, Karel Veselý, Alexander Blatt et al.
Bootstrap an End-to-End ASR System by Multilingual Training, Transfer Learning, Text-to-Text Mapping and Synthetic Audio
Manuel Giollo, Deniz Gunceler, Yulan Liu et al.
Bridging the Gap Between Streaming and Non-Streaming ASR Systems by Distilling Ensembles of CTC and RNN-T Models
Thibault Doutre, Wei Han, Chung-Cheng Chiu et al.
Broadcasted Residual Learning for Efficient Keyword Spotting
Byeonggeun Kim, Simyung Chang, Jinkyu Lee et al.
Cancellation of Local Competing Speaker with Near-Field Localization for Distributed ad-hoc Sensor Network
Pablo Pérez Zarazaga, Mariem Bouafif Mansali, Tom Bäckström et al.
Cascaded Multilingual Audio-Visual Learning from Videos
Andrew Rouditchenko, Angie Boggust, David Harwath et al.
Causal Confusion Reduction for Robust Multi-Domain Dialogue Policy
Mahdin Rohmatillah, Jen-Tzung Chien
Changes in Glottal Source Parameter Values with Light to Moderate Physical Load
Heather Weston, Laura L. Koenig, Susanne Fuchs
Channel-Wise Gated Res2Net: Towards Robust Detection of Synthetic Speech Attacks
Xu Li, Xixin Wu, Hui Lu et al.
Characterizing Voiced and Voiceless Nasals in Mizo
Wendy Lalhminghlui, Priyankoo Sarmah
Child Language Acquisition Studied with Wearables
Alejandrina Cristia
Chronological Self-Training for Real-Time Speaker Diarization
Dirk Padfield, Daniel J. Liebling
CLAC: A Speech Corpus of Healthy English Speakers
R’mani Haulcy, James Glass
Clarity-2021 Challenges: Machine Learning Challenges for Advancing Hearing Aid Processing
Simone Graetzer, Jon Barker, Trevor J. Cox et al.
Class-Based Neural Network Language Model for Second-Pass Rescoring in ASR
Lingfeng Dai, Qi Liu, Kai Yu
Classification of COVID-19 from Cough Using Autoregressive Predictive Coding Pretraining and Spectral Data Augmentation
John Harvill, Yash R. Wani, Mark Hasegawa-Johnson et al.
CNN-Based Processing of Acoustic and Radio Frequency Signals for Speaker Localization from MAVs
Andrea Toma, Daniele Salvati, Carlo Drioli et al.
Coded Speech Enhancement Using Neural Network-Based Vector-Quantized Residual Features
Youngju Cheon, Soojoong Hwang, Sangwook Han et al.
CoDERT: Distilling Encoder Representations with Co-Learning for Transducer-Based Speech Recognition
Rupak Vignesh Swaminathan, Brian King, Grant P. Strimel et al.
Collaborative Training of Acoustic Encoders for Speech Recognition
Varun Nagaraja, Yangyang Shi, Ganesh Venkatesh et al.
Combating Reverberation in NTF-Based Speech Separation Using a Sub-Source Weighted Multichannel Wiener Filter and Linear Prediction
Mieszko Fraś, Marcin Witkowski, Konrad Kowalczyk
Combining Hybrid and End-to-End Approaches for the OpenASR20 Challenge
Tanel Alumäe, Jiaming Kong