Papers
Multi-resolution Gammachirp Envelope Distortion Index for Intelligibility Prediction of Noisy Speech
Katsuhiko Yamamoto, Toshio Irino, Narumi Ohashi et al.
Multi-talker Speech Separation Based on Permutation Invariant Training and Beamforming
Lu Yin, Ziteng Wang, Risheng Xia et al.
Multi-target Voice Conversion without Parallel Data by Adversarially Learning Disentangled Audio Representations
Ju-chieh Chou, Cheng-chieh Yeh, Hung-yi Lee et al.
Multi-Task Learning of Speech Recognition and Speech Synthesis Parameters for Ultrasound-based Silent Speech Interfaces
László Tóth, Gábor Gosztolya, Tamás Grósz et al.
Multi-task Learning with Augmentation Strategy for Acoustic-to-word Attention-based Encoder-decoder Speech Recognition
Takafumi Moriya, Sei Ueno, Yusuke Shinohara et al.
Music Genre Recognition Using Deep Neural Networks and Transfer Learning
Deepanway Ghosal, Maheshkumar H. Kolekar
Music Source Activity Detection and Separation Using Deep Attractor Network
Rajath Kumar, Yi Luo, Nima Mesgarani
Naturalness Improvement Algorithm for Reconstructed Glossectomy Patient's Speech Using Spectral Differential Modification in Voice Conversion
Hiroki Murakami, Sunao Hara, Masanobu Abe et al.
Neural Error Corrective Language Models for Automatic Speech Recognition
Tomohiro Tanaka, Ryo Masumura, Hirokazu Masataki et al.
Neural Language Codes for Multilingual Acoustic Models
Markus Müller, Sebastian Stüker, Alex Waibel
Neural MultiVoice Models for Expressing Novel Personalities in Dialog
Shereen Oraby, Lena Reed, Sharath T.S. et al.
Neural Response Development During Distributional Learning
Natalie Boll-Avetisyan, Jessie S. Nixon, Tomas O. Lentz et al.
Neural Speech Turn Segmentation and Affinity Propagation for Speaker Diarization
Ruiqing Yin, Hervé Bredin, Claude Barras
Noise Robust Acoustic to Articulatory Speech Inversion
Nadee Seneviratne, Ganesh Sivaraman, Vikramjit Mitra et al.
Non-Uniform Spectral Smoothing for Robust Children's Speech Recognition
Ishwar Chandra Yadav, Avinash Kumar, Syed Shahnawazuddin et al.
Novel Empirical Mode Decomposition Cepstral Features for Replay Spoof Detection
Prasad Tapkir, Hemant Patil
Novel Linear Frequency Residual Cepstral Features for Replay Attack Detection
Hemlata Tak, Hemant Patil
On Convolutional LSTM Modeling for Joint Wake-Word Detection and Text Dependent Speaker Verification
Rajath Kumar, Vaishnavi Yeruva, Sriram Ganapathy
On Enhancing Speech Emotion Recognition Using Generative Adversarial Networks
Saurabh Sahu, Rahul Gupta, Carol Espy-Wilson
On Learning to Identify Genders from Raw Speech Signal Using CNNs
Selen Hande Kabil, Hannah Muckenhirn, Mathew Magimai.-Doss
On Learning Vocal Tract System Related Speaker Discriminative Information from Raw Signal Using CNNs
Hannah Muckenhirn, Mathew Magimai.-Doss, Sebastien Marcel