Research Explorer

Multi-channel Attention for End-to-End Speech Recognition

Stefan Braun, Daniel Neil, Jithendar Anumula et al.

2018 INTERSPEECH

Multicomponent 2-D AM-FM Modeling of Speech Spectrograms

Jitendra Kumar Dhiman, Neeraj Sharma, Chandra Sekhar Seelamantula

2018 INTERSPEECH

Multi-frame Coding of LSF Parameters Using Block-Constrained Trellis Coded Vector Quantization

Yaxing Li, Shan Xu, Shengwu Xiong et al.

2018 INTERSPEECH

Multi-frame Quantization of LSF Parameters Using a Deep Autoencoder and Pyramid Vector Quantizer

Yaxing Li, Eshete Derb Emiru, Shengwu Xiong et al.

2018 INTERSPEECH

Multi-Head Decoder for End-to-End Speech Recognition

Tomoki Hayashi, Shinji Watanabe, Tomoki Toda et al.

2018 INTERSPEECH

Multilingual Bottleneck Features for Subword Modeling in Zero-resource Languages

Enno Hermann, Sharon Goldwater

2018 INTERSPEECH

Multilingual Deep Neural Network Training Using Cyclical Learning Rate

Andreas Søeborg Kirkedal, Yeon-Jun Kim

2018 INTERSPEECH

Multi-Lingual Depression-Level Assessment from Conversational Speech Using Acoustic and Text Features

Yasin Özkanca, Cenk Demiroglu, Aslı Besirli et al.

2018 INTERSPEECH

Multilingual Grapheme-to-Phoneme Conversion with Global Character Vectors

Jinfu Ni, Yoshinori Shiga, Hisashi Kawai

2018 INTERSPEECH

Multilingual Neural Network Acoustic Modelling for ASR of Under-Resourced English-isiZulu Code-Switched Speech

Astik Biswas, Febe de Wet, Ewald van der Westhuizen et al.

2018 INTERSPEECH

Multi-modal Attention Mechanisms in LSTM and Its Application to Acoustic Scene Classification

Teng Zhang, Kailai Zhang, Ji Wu

2018 INTERSPEECH

Multi-Modal Data Augmentation for End-to-end ASR

Adithya Renduchintala, Shuoyang Ding, Matthew Wiesner et al.

2018 INTERSPEECH

Multimodal I-vectors to Detect and Evaluate Parkinson's Disease

Nicanor Garcia, Juan Camilo Vásquez Correa, Juan Rafael Orozco-Arroyave et al.

2018 INTERSPEECH

Multimodal Name Recognition in Live TV Subtitling

Marek Hrúz, Aleš Pražák, Michal Bušta

2018 INTERSPEECH

Multimodal Polynomial Fusion for Detecting Driver Distraction

Yulun Du, Alan W Black, Louis-Philippe Morency et al.

2018 INTERSPEECH

Multimodal Speaker Segmentation and Diarization Using Lexical and Acoustic Cues via Sequence to Sequence Neural Networks

Tae Jin Park, Panayiotis Georgiou

2018 INTERSPEECH

Multimodal Speech Synthesis Architecture for Unsupervised Speaker Adaptation

Hieu-Thi Luong, Junichi Yamagishi

2018 INTERSPEECH

Multiple Concurrent Sound Source Tracking Based on Observation-Guided Adaptive Particle Filter

Hong Liu, Haipeng Lan, Bing Yang et al.

2018 INTERSPEECH

Multiple Instance Deep Learning for Weakly Supervised Small-Footprint Audio Event Detection

Shao-Yen Tseng, Juncheng Li, Yun Wang et al.

2018 INTERSPEECH

Multiple Phase Information Combination for Replay Attacks Detection

Dongbo Li, Longbiao Wang, Jianwu Dang et al.

2018 INTERSPEECH

Multi-resolution Gammachirp Envelope Distortion Index for Intelligibility Prediction of Noisy Speech

Katsuhiko Yamamoto, Toshio Irino, Narumi Ohashi et al.

2018 INTERSPEECH

Multi-talker Speech Separation Based on Permutation Invariant Training and Beamforming

Lu Yin, Ziteng Wang, Risheng Xia et al.

2018 INTERSPEECH

Multi-target Voice Conversion without Parallel Data by Adversarially Learning Disentangled Audio Representations

Ju-chieh Chou, Cheng-chieh Yeh, Hung-yi Lee et al.

2018 INTERSPEECH

Multi-Task Learning of Speech Recognition and Speech Synthesis Parameters for Ultrasound-based Silent Speech Interfaces

László Tóth, Gábor Gosztolya, Tamás Grósz et al.

2018 INTERSPEECH

Multi-task Learning with Augmentation Strategy for Acoustic-to-word Attention-based Encoder-decoder Speech Recognition

Takafumi Moriya, Sei Ueno, Yusuke Shinohara et al.

2018 INTERSPEECH

Papers