Papers
Multi-channel Attention for End-to-End Speech Recognition
Stefan Braun, Daniel Neil, Jithendar Anumula et al.
Multicomponent 2-D AM-FM Modeling of Speech Spectrograms
Jitendra Kumar Dhiman, Neeraj Sharma, Chandra Sekhar Seelamantula
Multi-frame Coding of LSF Parameters Using Block-Constrained Trellis Coded Vector Quantization
Yaxing Li, Shan Xu, Shengwu Xiong et al.
Multi-frame Quantization of LSF Parameters Using a Deep Autoencoder and Pyramid Vector Quantizer
Yaxing Li, Eshete Derb Emiru, Shengwu Xiong et al.
Multi-Head Decoder for End-to-End Speech Recognition
Tomoki Hayashi, Shinji Watanabe, Tomoki Toda et al.
Multilingual Bottleneck Features for Subword Modeling in Zero-resource Languages
Enno Hermann, Sharon Goldwater
Multilingual Deep Neural Network Training Using Cyclical Learning Rate
Andreas Søeborg Kirkedal, Yeon-Jun Kim
Multi-Lingual Depression-Level Assessment from Conversational Speech Using Acoustic and Text Features
Yasin Özkanca, Cenk Demiroglu, Aslı Besirli et al.
Multilingual Grapheme-to-Phoneme Conversion with Global Character Vectors
Jinfu Ni, Yoshinori Shiga, Hisashi Kawai
Multilingual Neural Network Acoustic Modelling for ASR of Under-Resourced English-isiZulu Code-Switched Speech
Astik Biswas, Febe de Wet, Ewald van der Westhuizen et al.
Multi-modal Attention Mechanisms in LSTM and Its Application to Acoustic Scene Classification
Teng Zhang, Kailai Zhang, Ji Wu
Multi-Modal Data Augmentation for End-to-end ASR
Adithya Renduchintala, Shuoyang Ding, Matthew Wiesner et al.
Multimodal I-vectors to Detect and Evaluate Parkinson's Disease
Nicanor Garcia, Juan Camilo Vásquez Correa, Juan Rafael Orozco-Arroyave et al.
Multimodal Name Recognition in Live TV Subtitling
Marek Hrúz, Aleš Pražák, Michal Bušta
Multimodal Polynomial Fusion for Detecting Driver Distraction
Yulun Du, Alan W Black, Louis-Philippe Morency et al.
Multimodal Speaker Segmentation and Diarization Using Lexical and Acoustic Cues via Sequence to Sequence Neural Networks
Tae Jin Park, Panayiotis Georgiou
Multimodal Speech Synthesis Architecture for Unsupervised Speaker Adaptation
Hieu-Thi Luong, Junichi Yamagishi
Multiple Concurrent Sound Source Tracking Based on Observation-Guided Adaptive Particle Filter
Hong Liu, Haipeng Lan, Bing Yang et al.
Multiple Instance Deep Learning for Weakly Supervised Small-Footprint Audio Event Detection
Shao-Yen Tseng, Juncheng Li, Yun Wang et al.
Multiple Phase Information Combination for Replay Attacks Detection
Dongbo Li, Longbiao Wang, Jianwu Dang et al.
Multi-resolution Gammachirp Envelope Distortion Index for Intelligibility Prediction of Noisy Speech
Katsuhiko Yamamoto, Toshio Irino, Narumi Ohashi et al.
Multi-talker Speech Separation Based on Permutation Invariant Training and Beamforming
Lu Yin, Ziteng Wang, Risheng Xia et al.
Multi-target Voice Conversion without Parallel Data by Adversarially Learning Disentangled Audio Representations
Ju-chieh Chou, Cheng-chieh Yeh, Hung-yi Lee et al.
Multi-Task Learning of Speech Recognition and Speech Synthesis Parameters for Ultrasound-based Silent Speech Interfaces
László Tóth, Gábor Gosztolya, Tamás Grósz et al.
Multi-task Learning with Augmentation Strategy for Acoustic-to-word Attention-based Encoder-decoder Speech Recognition
Takafumi Moriya, Sei Ueno, Yusuke Shinohara et al.