Papers
Automatic Speech Recognition for ILSE-Interviews: Longitudinal Conversational Speech Recordings Covering Aging and Cognitive Decline
Ayimunishagu Abulimiti, Jochen Weiner, Tanja Schultz
Autosegmental Neural Nets: Should Phones and Tones be Synchronous or Asynchronous?
Jialu Li, Mark Hasegawa-Johnson
AutoSpeech 2020: The Second Automated Machine Learning Challenge for Speech Classification
Jingsong Wang, Tom Ko, Zhen Xu et al.
AutoSpeech: Neural Architecture Search for Speaker Recognition
Shaojin Ding, Tianlong Chen, Xinyu Gong et al.
Bandpass Noise Generation and Augmentation for Unified ASR
Kshitiz Kumar, Bo Ren, Yifan Gong et al.
Bidirectional LSTM Network with Ordered Neurons for Speech Enhancement
Xiaoqi Li, Yaxing Li, Yuanjie Dong et al.
Bi-Encoder Transformer Network for Mandarin-English Code-Switching Speech Recognition Using Mixture of Experts
Yizhou Lu, Mingkun Huang, Hao Li et al.
Bi-Level Speaker Supervision for One-Shot Speech Synthesis
Tao Wang, Jianhua Tao, Ruibo Fu et al.
Bilingual Acoustic Voice Variation is Similarly Structured Across Languages
Khia A. Johnson, Molly Babel, Robert A. Fuhrman
BlaBla: Linguistic Feature Extraction for Clinical Analysis in Multiple Languages
Abhishek Shivkumar, Jack Weston, Raphael Lenain et al.
Black-Box Adaptation of ASR for Accented Speech
Kartik Khandelwal, Preethi Jyothi, Abhijeet Awasthi et al.
Black-Box Attacks on Spoofing Countermeasures Using Transferability of Adversarial Examples
Yuekai Zhang, Ziyan Jiang, Jesús Villalba et al.
Blind Speech Signal Quality Estimation for Speaker Verification Systems
Galina Lavrentyeva, Marina Volkova, Anastasia Avdeeva et al.
BLSTM-Driven Stream Fusion for Automatic Speech Recognition: Novel Methods and a Multi-Size Window Fusion Example
Timo Lohrenz, Tim Fingscheidt
Brain networks enabling speech perception in everyday settings
Barbara Shinn-Cunningham
Building a Robust Word-Level Wakeword Verification Network
Rajath Kumar, Mike Rodehorst, Joe Wang et al.
Bunched LPCNet: Vocoder for Low-Cost Neural Text-To-Speech Systems
Ravichander Vipperla, Sangjun Park, Kihyun Choo et al.
BUT Text-Dependent Speaker Verification System for SdSV Challenge 2020
Alicia Lozano-Diez, Anna Silnova, Bhargav Pulugundla et al.
CAM: Uninteresting Speech Detector
Weiyi Lu, Yi Xu, Peng Yang et al.
Can Auditory Nerve Models Tell us What’s Different About WaveNet Vocoded Speech?
Sébastien Le Maguer, Naomi Harte
Can Speaker Augmentation Improve Multi-Speaker End-to-End TTS?
Erica Cooper, Cheng-I Lai, Yusuke Yasuda et al.
Caption Alignment for Low Resource Audio-Visual Data
Vighnesh Reddy Konda, Mayur Warialani, Rakesh Prasanth Achari et al.
CAT: A CTC-CRF Based ASR Toolkit Bridging the Hybrid and the End-to-End Approaches Towards Data Efficiency and Low Latency
Keyu An, Hongyu Xiang, Zhijian Ou
Categorization of Whistled Consonants by French Speakers
Anaïs Tran Ngoc, Julien Meyer, Fanny Meunier
CATOTRON — A Neural Text-to-Speech System in Catalan
Baybars Külebi, Alp Öktem, Alex Peiró-Lilja et al.