Papers
Investigating Robustness of Adversarial Samples Detection for Automatic Speaker Verification
Xu Li, Na Li, Jinghua Zhong et al.
Investigating Self-Supervised Pre-Training for End-to-End Speech Translation
Ha Nguyen, Fethi Bougares, N. Tomashenko et al.
Investigating the Visual Lombard Effect with Gabor Based Features
Waito Chiu, Yan Xu, Andrew Abel et al.
Investigation of Data Augmentation Techniques for Disordered Speech Recognition
Mengzhe Geng, Xurong Xie, Shansong Liu et al.
Investigation of Large-Margin Softmax in Neural Language Modeling
Jingjing Huo, Yingbo Gao, Weiyue Wang et al.
Investigation of NICT Submission for Short-Duration Speaker Verification Challenge 2020
Peng Shen, Xugang Lu, Hisashi Kawai
Investigation of Phase Distortion on Perceived Speech Quality for Hearing-Impaired Listeners
Zhuohuang Zhang, Donald S. Williamson, Yi Shen
Is Everything Fine, Grandma? Acoustic and Linguistic Modeling for Robust Elderly Speech Emotion Recognition
Gizem Soğancıoğlu, Oxana Verkholyak, Heysem Kaya et al.
Iterative Compression of End-to-End ASR Model Using AutoML
Abhinav Mehrotra, Łukasz Dudziak, Jinsu Yeo et al.
Iterative Pseudo-Labeling for Speech Recognition
Qiantong Xu, Tatiana Likhomanenko, Jacob Kahn et al.
JDI-T: Jointly Trained Duration Informed Transformer for Text-To-Speech without Explicit Alignment
Dan Lim, Won Jang, Gyeonghwan O et al.
Joint Detection of Sentence Stress and Phrase Boundary for Prosody
Binghuai Lin, Liyuan Wang, Xiaoli Feng et al.
Jointly Encoding Word Confusion Network and Dialogue Context with BERT for Spoken Language Understanding
Chen Liu, Su Zhu, Zijian Zhao et al.
Jointly Fine-Tuning “BERT-Like” Self Supervised Models to Improve Multimodal Speech Emotion Recognition
Shamane Siriwardhana, Andrew Reis, Rivindu Weerasekera et al.
Joint Prediction of Punctuation and Disfluency in Speech Transcripts
Binghuai Lin, Liyuan Wang
Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of any Number of Speakers
Naoyuki Kanda, Yashesh Gaur, Xiaofei Wang et al.
Joint Training for Simultaneous Speech Denoising and Dereverberation with Deep Embedding Representations
Cunhang Fan, Jianhua Tao, Bin Liu et al.
JukeBox: A Multilingual Singer Recognition Dataset
Anurag Chowdhury, Austin Cozzo, Arun Ross
Kaldi-Web: An Installation-Free, On-Device Speech Recognition System
Mathieu Hu, Laurent Pierron, Emmanuel Vincent et al.
Knowledge-and-Data-Driven Amplitude Spectrum Prediction for Hierarchical Neural Vocoders
Yang Ai, Zhen-Hua Ling
Knowledge Distillation from Offline to Streaming RNN Transducer for End-to-End Speech Recognition
Gakuto Kurata, George Saon
LAIX Corpus of Chinese Learner English: Towards a Benchmark for L2 English ASR
Yanhong Wang, Huan Luan, Jiahong Yuan et al.
Language Model Data Augmentation Based on Text Domain Transfer
Atsunori Ogawa, Naohiro Tawara, Marc Delcroix
Language Modeling for Speech Analytics in Under-Resourced Languages
Simone Wills, Pieter Uys, Charl van Heerden et al.
Large-Scale End-to-End Multilingual Speech Recognition and Language Identification with Multi-Task Learning
Wenxin Hou, Yue Dong, Bairong Zhuang et al.