Papers
Introducing the VoicePrivacy Initiative
N. Tomashenko, Brij Mohan Lal Srivastava, Xin Wang et al.
Investigating Effective Additional Contextual Factors in DNN-Based Spontaneous Speech Synthesis
Yuki Yamashita, Tomoki Koriyama, Yuki Saito et al.
Investigating Light-ResNet Architecture for Spoofing Detection Under Mismatched Conditions
Prasanth Parasu, Julien Epps, Kaavya Sriskandaraja et al.
Investigating Robustness of Adversarial Samples Detection for Automatic Speaker Verification
Xu Li, Na Li, Jinghua Zhong et al.
Investigating Self-Supervised Pre-Training for End-to-End Speech Translation
Ha Nguyen, Fethi Bougares, N. Tomashenko et al.
Investigating the Visual Lombard Effect with Gabor Based Features
Waito Chiu, Yan Xu, Andrew Abel et al.
Investigation of Data Augmentation Techniques for Disordered Speech Recognition
Mengzhe Geng, Xurong Xie, Shansong Liu et al.
Investigation of Large-Margin Softmax in Neural Language Modeling
Jingjing Huo, Yingbo Gao, Weiyue Wang et al.
Investigation of NICT Submission for Short-Duration Speaker Verification Challenge 2020
Peng Shen, Xugang Lu, Hisashi Kawai
Investigation of Phase Distortion on Perceived Speech Quality for Hearing-Impaired Listeners
Zhuohuang Zhang, Donald S. Williamson, Yi Shen
Is Everything Fine, Grandma? Acoustic and Linguistic Modeling for Robust Elderly Speech Emotion Recognition
Gizem Soğancıoğlu, Oxana Verkholyak, Heysem Kaya et al.
Iterative Compression of End-to-End ASR Model Using AutoML
Abhinav Mehrotra, Łukasz Dudziak, Jinsu Yeo et al.
Iterative Pseudo-Labeling for Speech Recognition
Qiantong Xu, Tatiana Likhomanenko, Jacob Kahn et al.
JDI-T: Jointly Trained Duration Informed Transformer for Text-To-Speech without Explicit Alignment
Dan Lim, Won Jang, Gyeonghwan O et al.
Joint Detection of Sentence Stress and Phrase Boundary for Prosody
Binghuai Lin, Liyuan Wang, Xiaoli Feng et al.
Jointly Encoding Word Confusion Network and Dialogue Context with BERT for Spoken Language Understanding
Chen Liu, Su Zhu, Zijian Zhao et al.
Jointly Fine-Tuning “BERT-Like” Self Supervised Models to Improve Multimodal Speech Emotion Recognition
Shamane Siriwardhana, Andrew Reis, Rivindu Weerasekera et al.
Joint Prediction of Punctuation and Disfluency in Speech Transcripts
Binghuai Lin, Liyuan Wang
Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of any Number of Speakers
Naoyuki Kanda, Yashesh Gaur, Xiaofei Wang et al.
Joint Training for Simultaneous Speech Denoising and Dereverberation with Deep Embedding Representations
Cunhang Fan, Jianhua Tao, Bin Liu et al.
JukeBox: A Multilingual Singer Recognition Dataset
Anurag Chowdhury, Austin Cozzo, Arun Ross
Kaldi-Web: An Installation-Free, On-Device Speech Recognition System
Mathieu Hu, Laurent Pierron, Emmanuel Vincent et al.
Knowledge-and-Data-Driven Amplitude Spectrum Prediction for Hierarchical Neural Vocoders
Yang Ai, Zhen-Hua Ling
Knowledge Distillation from Offline to Streaming RNN Transducer for End-to-End Speech Recognition
Gakuto Kurata, George Saon
LAIX Corpus of Chinese Learner English: Towards a Benchmark for L2 English ASR
Yanhong Wang, Huan Luan, Jiahong Yuan et al.