Papers
8,761 papers found
Modeling Time-Frequency Patterns with LSTM vs. Convolutional Architectures for LVCSR Tasks
Tara N. Sainath, Bo Li
Model Integration for HMM- and DNN-Based Speech Synthesis Using Product-of-Experts Framework
Kentaro Tachibana, Tomoki Toda, Yoshinori Shiga et al.
Modulation Enhancement of Temporal Envelopes for Increasing Speech Intelligibility in Noise
Maria Koutsogiannaki, Yannis Stylianou
Modulation Spectral Features for Predicting Vocal Emotion Recognition by Simulated Cochlear Implants
Zhi Zhu, Ryota Miyauchi, Yukiko Araki et al.
Monaural Source Separation Using a Random Forest Classifier
Cosimo Riday, Saurabh Bhargava, Richard H.R. Hahnloser et al.
Multi-Attribute Factorized Hidden Layer Adaptation for DNN Acoustic Models
Lahiru Samarakoon, Khe Chai Sim
Multi-Channel Linear Prediction Based on Binaural Coherence for Speech Dereverberation
Hong Liu, Xiuling Wang, Miao Sun et al.
Multichannel Spatial Clustering for Robust Far-Field Automatic Speech Recognition in Mismatched Conditions
Michael I. Mandel, Jon Barker
Multidimensional Residual Learning Based on Recurrent Neural Networks for Acoustic Modeling
Yuanyuan Zhao, Shuang Xu, Bo Xu
Multi-Domain Joint Semantic Frame Parsing Using Bi-Directional RNN-LSTM
Dilek Hakkani-Tür, Gokhan Tur, Asli Celikyilmaz et al.
Multi-Language Neural Network Language Models
Anton Ragni, Edgar Dakin, Xie Chen et al.
Multilingual Data Selection for Low Resource Speech Recognition
Samuel Thomas, Kartik Audhkhasi, Jia Cui et al.
Multilingual Speech Emotion Recognition System Based on a Three-Layer Model
Xingfeng Li, Masato Akagi
Multimodal Fusion of Multirate Acoustic, Prosodic, and Lexical Speaker Characteristics for Native Language Identification
Prashanth Gurunath Shivakumar, Sandeep Nallan Chakravarthula, Panayiotis Georgiou
Multiple Influences on Vocabulary Acquisition: Parental Input Dominates
Dominic W. Massaro
Multi-Talker Speech Recognition Based on Blind Source Separation with ad hoc Microphone Array Using Smartphones and Cloud Storage
Keiko Ochi, Nobutaka Ono, Shigeki Miyabe et al.
Multi-Task Learning and Weighted Cross-Entropy for DNN-Based Keyword Spotting
Sankaran Panchapagesan, Ming Sun, Aparna Khare et al.
My-Own-Voice: A Web Service That Allows You to Create a Text-to-Speech Voice From Your Own Voice
Fabrice Malfrere, Olivier Deroo, Emmanuelle Franques et al.
Native Language Detection Using the I-Vector Framework
Mohammed Senoussaoui, Patrick Cardinal, Najim Dehak et al.
Native Language Identification Using Spectral and Source-Based Features
Avni Rajpal, Tanvina B. Patel, Hardik B. Sailor et al.
Naturalness Judgement of L2 English Through Dubbing Practice
Dean Luo, Ruxin Luo, Lixin Wang
Neural Network Adaptive Beamforming for Robust Multichannel Speech Recognition
Bo Li, Tara N. Sainath, Ron J. Weiss et al.