Papers
Acoustic Word Embeddings for ASR Error Detection
Sahar Ghannay, Yannick Estève, Nathalie Camelin et al.
Active and Semi-Supervised Learning in ASR: Benefits on the Acoustic and Language Models
Thomas Drugman, Janne Pylkkönen, Reinhard Kneser
Adaptation of Neural Networks Constrained by Prior Statistics of Node Co-Activations
Tasha Nagamine, Zhuo Chen, Nima Mesgarani
Adaptive Group Sparsity for Non-Negative Matrix Factorization with Application to Unsupervised Source Separation
Xu Li, Ziteng Wang, Xiaofei Wang et al.
Adaptive Latency for Part-of-Speech Tagging in Incremental Text-to-Speech Synthesis
Maël Pouget, Olha Nahorna, Thomas Hueber et al.
A Deep Learning Approach to Modeling Empathy in Addiction Counseling
James Gibson, Doğan Can, Bo Xiao et al.
A Divide-and-Conquer Approach for Language Identification Based on Recurrent Neural Networks
G. Gelly, Jean-Luc Gauvain, V.B. Le et al.
A DNN-HMM Approach to Non-Negative Matrix Factorization Based Speech Enhancement
Ziteng Wang, Xu Li, Xiaofei Wang et al.
A DNN-HMM Approach to Story Segmentation
Jia Yu, Xiong Xiao, Lei Xie et al.
Advances in Very Deep Convolutional Neural Networks for LVCSR
Tom Sercu, Vaibhava Goel
A Fast and Accurate Fundamental Frequency Estimator Using Recursive Moving Average Filters
Ryunosuke Daido, Yuji Hisaminato
A Feature Normalisation Technique for PLLR Based Language Identification Systems
Sarith Fernando, Vidhyasaharan Sethu, Eliathamby Ambikairajah
A Feature Study for Masking-Based Reverberant Speech Separation
Masood Delfarah, DeLiang Wang
A Framework for Automated Marmoset Vocalization Detection and Classification
Alan Wisler, Laura J. Brattain, Rogier Landman et al.
A Framework for Practical Multistream ASR
Sri Harish Mallidi, Hynek Hermansky
A French Corpus for Distant-Microphone Speech Processing in Real Homes
Nancy Bertin, Ewen Camberlein, Emmanuel Vincent et al.
A Hierarchical Predictor of Synthetic Speech Naturalness Using Neural Networks
Takenori Yoshimura, Gustav Eje Henter, Oliver Watts et al.
A Hybrid System for Continuous Word-Level Emphasis Modeling Based on HMM State Clustering and Adaptive Training
Quoc Truong Do, Tomoki Toda, Graham Neubig et al.
A KL Divergence and DNN-Based Approach to Voice Conversion without Parallel Training Sentences
Feng-Long Xie, Frank K. Soong, Haifeng Li
A Longitudinal Study of Children’s Intonation in Narrative Speech
Jeffrey Kallay, Melissa A. Redford
A Multimodal Dialogue System for Air Traffic Control Trainees Based on Discrete-Event Simulation
Luboš Šmídl, Adam Chýlek, Jan Švec
An Acoustic Analysis of Child-Child and Child-Robot Interactions for Understanding Engagement during Speech-Controlled Computer Games
Theodora Chaspari, Jill Fain Lehman
An Acoustic Analysis of /r/ in Tyrolean
Vincenzo Galatà, Lorenzo Spreafico, Alessandro Vietti et al.