Papers
A Compact and Discriminative Feature Based on Auditory Summary Statistics for Acoustic Scene Classification
Hongwei Song, Jiqing Han, Shiwen Deng
A Comparative Study of Statistical Conversion of Face to Voice Based on Their Subjective Impressions
Yasuhito Ohsugi, Daisuke Saito, Nobuaki Minematsu
A Comparison of Input Types to a Deep Neural Network-based Forced Aligner
Matthew C. Kelley, Benjamin V. Tucker
A Comparison of Speaker-based and Utterance-based Data Selection for Text-to-Speech Synthesis
Kai-Zhan Lee, Erica Cooper, Julia Hirschberg
A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement
Ke Tan, DeLiang Wang
Acoustic Analysis of Whispery Voice Disguise in Mandarin Chinese
Cuiling Zhang, Bin Li, Si Chen et al.
Acoustic and Perceptual Characteristics of Mandarin Speech in Homosexual and Heterosexual Male Speakers
Puyang Geng, Wentao Gu, Hiroya Fujisaki
Acoustic and Textual Data Augmentation for Improved ASR of Code-Switching Speech
Emre Yılmaz, Henk van den Heuvel, David van Leeuwen
Acoustic-dependent Phonemic Transcription for Text-to-speech Synthesis
Kévin Vythelingum, Yannick Estève, Olivier Rosec
Acoustic Features Associated with Sustained Vowel and Continuous Speech Productions by Chinese Children with Functional Articulation Disorders
Wang Zhang, Xiangquan Gui, Tianqi Wang et al.
Acoustic Modeling from Frequency Domain Representations of Speech
Pegah Ghahremani, Hossein Hadian, Hang Lv et al.
Acoustic Modeling Using Adversarially Trained Variational Recurrent Neural Network for Speech Synthesis
Joun Yeop Lee, Sung Jun Cheon, Byoung Jin Choi et al.
Acoustic Modeling with Densely Connected Residual Network for Multichannel Speech Recognition
Jian Tang, Yan Song, Lirong Dai et al.
Acoustic Modeling with DFSMN-CTC and Joint CTC-CE Learning
ShiLiang Zhang, Ming Lei
Acoustic-prosodic Entrainment in Structural Metadata Events
Vera Cabarrão, Fernando Batista, Helena Moniz et al.
Acoustic-Prosodic Indicators of Deception and Trust in Interview Dialogues
Sarah Ita Levitan, Angel Maredia, Julia Hirschberg
Active Learning for LF-MMI Trained Neural Networks in ASR
Yanhua Long, Hong Ye, Yijie Li et al.
Active Memory Networks for Language Modeling
Oscar Chen, Anton Ragni, Mark Gales et al.
Adding New Classes without Access to the Original Training Data with Applications to Language Identification
Hagai Taitelbaum, Ehud Ben-Reuven, Jacob Goldberger
A Deep Identity Representation for Noise Robust Spoofing Detection
Alejandro Gómez Alanís, Antonio M. Peinado, Jose A. Gonzalez et al.
A Deep Learning Approach to Assessing Non-native Pronunciation of English Using Phone Distances
Konstantinos Kyriakopoulos, Kate Knill, Mark Gales
A Deep Learning Method for Pathological Voice Detection Using Convolutional Deep Belief Networks
Huiyi Wu, John Soraghan, Anja Lowit et al.
A Deep Neural Network Based Harmonic Noise Model for Speech Enhancement
Zhiheng Ouyang, Hongjiang Yu, Wei-Ping Zhu et al.
A Deep Reinforcement Learning Based Multimodal Coaching Model (DCM) for Slot Filling in Spoken Language Understanding(SLU)
Yu Wang, Abhishek Patel, Yilin Shen et al.