Papers
Fusion Strategies for Robust Speech Recognition and Keyword Spotting for Channel- and Noise-Degraded Speech
Vikramjit Mitra, Julien VanHout, Wen Wang et al.
Future Context Attention for Unidirectional LSTM Based Acoustic Model
Jian Tang, Shiliang Zhang, Si Wei et al.
Gating Recurrent Enhanced Memory Neural Networks on Language Identification
Wang Geng, Yuanyuan Zhao, Wenfu Wang et al.
Generalized Discriminant Analysis (GDA) for Improved i-Vector Based Speaker Recognition
Fahimeh Bahmaninezhad, John H.L. Hansen
Generalizing Steady State Suppression for Enhanced Intelligibility Under Reverberation
Petko N. Petkov, Yannis Stylianou
Generating Complementary Acoustic Model Spaces in DNN-Based Sequence-to-Frame DTW Scheme for Out-of-Vocabulary Spoken Term Detection
Shi-wook Lee, Kazuyo Tanaka, Yoshiaki Itoh
Generating Gestural Scores from Acoustics Through a Sparse Anchor-Based Representation of Speech
Christopher Liberatore, Ricardo Gutierrez-Osuna
Generating Natural Video Descriptions via Multimodal Processing
Qin Jin, Junwei Liang, Xiaozhu Lin
Generation and Pruning of Pronunciation Variants to Improve ASR Accuracy
Zhenhao Ge, Aravind Ganapathiraju, Ananth N. Iyer et al.
Generation of Emotion Control Vector Using MDS-Based Space Transformation for Expressive Speech Synthesis
Yan-You Chen, Chung-Hsien Wu, Yu-Fong Huang
Generative Acoustic-Phonemic-Speaker Model Based on Three-Way Restricted Boltzmann Machine
Toru Nakashika, Yasuhiro Minami
Glimpse-Based Metrics for Predicting Speech Intelligibility in Additive Noise Conditions
Yan Tang, Martin Cooke
Glottal Squeaks in VC Sequences
Míša Hejná, Pertti Palo, Scott Moisik
GlottDNN — A Full-Band Glottal Vocoder for Statistical Parametric Speech Synthesis
Manu Airaksinen, Bajibabu Bollepalli, Lauri Juvela et al.
GMM-Free Flat Start Sequence-Discriminative DNN Training
Gábor Gosztolya, Tamás Grósz, László Tóth
HAPPY Team Entry to NIST OpenSAD Challenge: A Fusion of Short-Term Unsupervised and Segment i-Vector Based Speech Activity Detectors
Tomi Kinnunen, Alexey Sholokhov, Elie Khoury et al.
Head Motion Generation with Synthetic Speech: A Data Driven Approach
Najmeh Sadoughi, Carlos Busso
Hierarchical Classification of Speaker and Background Noise and Estimation of SNR Using Sparse Representation
K.V. Vijay Girish, A.G. Ramakrishnan, T.V. Ananthapadmanabha
Highlighting Psychological Features for Predicting Child Interjections During Story Telling
Gaël Lejeune, François Rioult, Bruno Crémilleux
HMM-Based Non-Native Accent Assessment Using Posterior Features
Ramya Rasipuram, Milos Cernak, Mathew Magimai-Doss
HMM-Based Speech Enhancement Using Sub-Word Models and Noise Adaptation
Akihiro Kato, Ben Milner
How Neural Network Depth Compensates for HMM Conditional Independence Assumptions in DNN-HMM Acoustic Models
Suman Ravuri, Steven Wegmann
Hybrid Accelerated Optimization for Speech Recognition
Jen-Tzung Chien, Pei-Wen Huang, Tan Lee
Hybrid Dialogue State Tracking for Real World Human-to-Human Dialogues
Kai Sun, Su Zhu, Lu Chen et al.