Papers
Multimodal Prediction of Affective Dimensions via Fusing Multiple Regression Techniques
D.-Y. Huang, Wan Ding, Mingyu Xu et al.
Multiple Sound Source Counting and Localization Based on Spatial Principal Eigenvector
Bing Yang, Hong Liu, Cheng Pang
Multi-Scale Context Adaptation for Improving Child Automatic Speech Recognition in Child-Adult Spoken Interactions
Manoj Kumar, Daniel Bone, Kelly McWilliams et al.
Multi-Stage DNN Training for Automatic Recognition of Dysarthric Speech
Emre Yılmaz, Mario Ganzeboom, Catia Cucchiarini et al.
Multi-Target Ensemble Learning for Monaural Speech Separation
Hui Zhang, Xueliang Zhang, Guanglai Gao
Multi-Task Learning for Mispronunciation Detection on Singapore Children’s Mandarin Speech
Rong Tong, Nancy F. Chen, Bin Ma
Multi-Task Learning for Prosodic Structure Generation Using BLSTM RNN with Structured Output Layer
Yuchen Huang, Zhiyong Wu, Runnan Li et al.
Multi-Task Learning Using Mismatched Transcription for Under-Resourced Speech Recognition
Van Hai Do, Nancy F. Chen, Boon Pang Lim et al.
Multitask Learning with CTC and Segmental CRF for Speech Recognition
Liang Lu, Lingpeng Kong, Chris Dyer et al.
Multitask Learning with Low-Level Auxiliary Tasks for Encoder-Decoder Based Speech Recognition
Shubham Toshniwal, Hao Tang, Liang Lu et al.
Multitask Sequence-to-Sequence Models for Grapheme-to-Phoneme Conversion
Benjamin Milde, Christoph Schmidt, Joachim Köhler
Multiview Representation Learning via Deep CCA for Silent Speech Recognition
Myungjong Kim, Beiming Cao, Ted Mau et al.
Musical Speech: A New Methodology for Transcribing Speech Prosody
Alexsandro R. Meireles, Antônio R.M. Simões, Antonio Celso Ribeiro et al.
Music Tempo Estimation Using Sub-Band Synchrony
Shreyan Chowdhury, Tanaya Guha, Rajesh M. Hegde
Mylly — The Mill: A New Platform for Processing Speech and Text Corpora Easily and Efficiently
Mietta Lennes, Jussi Piitulainen, Martin Matthiesen
Nativization of Foreign Names in TTS for Automatic Reading of World News in Swahili
Joseph Mendelson, Pilar Oplustil, Oliver Watts et al.
Nature of Contrast and Coarticulation: Evidence from Mizo Tones and Assamese Vowel Harmony
Indranil Dutta, Irfan S., Pamir Gogoi et al.
Neural Network-Based Spectrum Estimation for Online WPE Dereverberation
Keisuke Kinoshita, Marc Delcroix, Haeyong Kwon et al.
Neural Speech Recognizer: Acoustic-to-Word LSTM Model for Large Vocabulary Speech Recognition
Hagen Soltau, Hank Liao, Haşim Sak
NMT-Based Segmentation and Punctuation Insertion for Real-Time Spoken Language Translation
Eunah Cho, Jan Niehues, Alex Waibel
Node Pruning Based on Entropy of Weights and Node Activity for Small-Footprint Acoustic Model Based on Deep Neural Networks
Ryu Takeda, Kazuhiro Nakadai, Kazunori Komatani
Non-Local Estimation of Speech Signal for Vowel Onset Point Detection in Varied Environments
Avinash Kumar, S. Shahnawazuddin, Gayadhar Pradhan
Nonparametrically Trained Probabilistic Linear Discriminant Analysis for i-Vector Speaker Verification
Abbas Khosravani, Mohammad Mehdi Homayounpour
Non-Uniform MCE Training of Deep Long Short-Term Memory Recurrent Neural Networks for Keyword Spotting
Zhong Meng, Biing-Hwang Juang
Nora the Empathetic Psychologist
Genta Indra Winata, Onno Kampman, Yang Yang et al.