Papers
Zero Resource Speech Synthesis Using Transcripts Derived from Perceptual Acoustic Units
Karthik Pandia D. S., Hema A. Murthy
Zooming in on Spatiotemporal V-to-C Coarticulation with Functional PCA
Michele Gubian, Manfred Pastätter, Marianne Pouplier
A Case Study on the Importance of Belief State Representation for Dialogue Policy Management
Margarita Kotti, Vassilios Diakoloukas, Alexandros Papangelis et al.
A Compact and Discriminative Feature Based on Auditory Summary Statistics for Acoustic Scene Classification
Hongwei Song, Jiqing Han, Shiwen Deng
A Comparative Study of Statistical Conversion of Face to Voice Based on Their Subjective Impressions
Yasuhito Ohsugi, Daisuke Saito, Nobuaki Minematsu
A Comparison of Input Types to a Deep Neural Network-based Forced Aligner
Matthew C. Kelley, Benjamin V. Tucker
A Comparison of Speaker-based and Utterance-based Data Selection for Text-to-Speech Synthesis
Kai-Zhan Lee, Erica Cooper, Julia Hirschberg
A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement
Ke Tan, DeLiang Wang
Acoustic Analysis of Whispery Voice Disguise in Mandarin Chinese
Cuiling Zhang, Bin Li, Si Chen et al.
Acoustic and Perceptual Characteristics of Mandarin Speech in Homosexual and Heterosexual Male Speakers
Puyang Geng, Wentao Gu, Hiroya Fujisaki
Acoustic and Textual Data Augmentation for Improved ASR of Code-Switching Speech
Emre Yılmaz, Henk van den Heuvel, David van Leeuwen
Acoustic-dependent Phonemic Transcription for Text-to-speech Synthesis
Kévin Vythelingum, Yannick Estève, Olivier Rosec
Acoustic Features Associated with Sustained Vowel and Continuous Speech Productions by Chinese Children with Functional Articulation Disorders
Wang Zhang, Xiangquan Gui, Tianqi Wang et al.
Acoustic Modeling from Frequency Domain Representations of Speech
Pegah Ghahremani, Hossein Hadian, Hang Lv et al.
Acoustic Modeling Using Adversarially Trained Variational Recurrent Neural Network for Speech Synthesis
Joun Yeop Lee, Sung Jun Cheon, Byoung Jin Choi et al.
Acoustic Modeling with Densely Connected Residual Network for Multichannel Speech Recognition
Jian Tang, Yan Song, Lirong Dai et al.
Acoustic Modeling with DFSMN-CTC and Joint CTC-CE Learning
ShiLiang Zhang, Ming Lei
Acoustic-prosodic Entrainment in Structural Metadata Events
Vera Cabarrão, Fernando Batista, Helena Moniz et al.
Acoustic-Prosodic Indicators of Deception and Trust in Interview Dialogues
Sarah Ita Levitan, Angel Maredia, Julia Hirschberg
Active Learning for LF-MMI Trained Neural Networks in ASR
Yanhua Long, Hong Ye, Yijie Li et al.
Active Memory Networks for Language Modeling
Oscar Chen, Anton Ragni, Mark Gales et al.
Adding New Classes without Access to the Original Training Data with Applications to Language Identification
Hagai Taitelbaum, Ehud Ben-Reuven, Jacob Goldberger
A Deep Identity Representation for Noise Robust Spoofing Detection
Alejandro Gómez Alanís, Antonio M. Peinado, Jose A. Gonzalez et al.