Papers
Jasper: An End-to-End Convolutional Neural Acoustic Model
Jason Li, Vitaly Lavrukhin, Boris Ginsburg et al.
Joint Decoding of CTC Based Systems for Speech Recognition
Jiaqi Guo, Yongbin You, Yanmin Qian et al.
Joint Grapheme and Phoneme Embeddings for Contextual End-to-End ASR
Zhehuai Chen, Mahaveer Jain, Yongqiang Wang et al.
Jointly Adversarial Enhancement Training for Robust End-to-End Speech Recognition
Bin Liu, Shuai Nie, Shan Liang et al.
Jointly Trained Conversion Model and WaveNet Vocoder for Non-Parallel Voice Conversion Using Mel-Spectrograms and Phonetic Posteriorgrams
Songxiang Liu, Yuewen Cao, Xixin Wu et al.
Joint Maximization Decoder with Neural Converters for Fully Neural Network-Based Japanese Speech Recognition
Takafumi Moriya, Jian Wang, Tomohiro Tanaka et al.
Joint Optimization of Neural Acoustic Beamforming and Dereverberation with x-Vectors for Robust Speaker Verification
Joon-Young Yang, Joon-Hyuk Chang
Joint Speech Recognition and Speaker Diarization via Sequence Transduction
Laurent El Shafey, Hagen Soltau, Izhak Shafran
Joint Student-Teacher Learning for Audio-Visual Scene-Aware Dialog
Chiori Hori, Anoop Cherian, Tim K. Marks et al.
Joint Training Framework for Text-to-Speech and Voice Conversion Using Multi-Source Tacotron and WaveNet
Mingyang Zhang, Xin Wang, Fuming Fang et al.
Kernel Machines Beat Deep Neural Networks on Mask-Based Single-Channel Speech Enhancement
Like Hui, Siyuan Ma, Mikhail Belkin
Keyword Spotting for Hearing Assistive Devices Robust to External Speakers
Iván López-Espejo, Zheng-Hua Tan, Jesper Jensen
Kite: Automatic Speech Recognition for Unmanned Aerial Vehicles
Dan Oneață, Horia Cucu
KL-Divergence Regularized Deep Neural Network Adaptation for Low-Resource Speaker-Dependent Speech Enhancement
Li Chai, Jun Du, Chin-Hui Lee
Knowledge-Based Linguistic Encoding for End-to-End Mandarin Text-to-Speech Synthesis
Jingbei Li, Zhiyong Wu, Runnan Li et al.
Knowledge Distillation for End-to-End Monaural Multi-Talker ASR System
Wangyou Zhang, Xuankai Chang, Yanmin Qian
Knowledge Distillation for Throat Microphone Speech Recognition
Takahito Suzuki, Jun Ogata, Takashi Tsunakawa et al.
L2 Pronunciation Accuracy and Context: A Pilot Study on the Realization of Geminates in Italian as L2 by French Learners
Sonia d’Apolito, Barbara Gili Fivela
Label Driven Time-Frequency Masking for Robust Continuous Speech Recognition
Meet Soni, Ashish Panda
Language Learning Using Speech to Image Retrieval
Danny Merkx, Stefan L. Frank, Mirjam Ernestus
Language Modeling with Deep Transformers
Kazuki Irie, Albert Zeyer, Ralf Schlüter et al.
Language Recognition Using Triplet Neural Networks
Victoria Mingote, Diego Castan, Mitchell McLaren et al.
Large Margin Softmax Loss for Speaker Verification
Yi Liu, Liang He, Jia Liu
Large Margin Training for Attention Based End-to-End Speech Recognition
Peidong Wang, Jia Cui, Chao Weng et al.
Large-Scale Mixed-Bandwidth Deep Neural Network Acoustic Modeling for Automatic Speech Recognition
Khoi-Nguyen C. Mac, Xiaodong Cui, Wei Zhang et al.