Papers
Dialect Recognition Based on Unsupervised Bottleneck Features
Qian Zhang, John H.L. Hansen
Dialogue as Collaborative Problem Solving
James Allen
“Did you laugh enough today?” — Deep Neural Networks for Mobile and Wearable Laughter Trackers
Gerhard Hagerer, Nicholas Cummins, Florian Eyben et al.
Direct Acoustics-to-Word Models for English Conversational Speech Recognition
Kartik Audhkhasi, Bhuvana Ramabhadran, George Saon et al.
Direct Modeling of Frequency Spectra and Waveform Generation Based on Phase Recovery for DNN-Based Speech Synthesis
Shinji Takaki, Hirokazu Kameoka, Junichi Yamagishi
Direct Modelling of Magnitude and Phase Spectra for Statistical Parametric Speech Synthesis
Felipe Espic, Cassia Valentini Botinhao, Simon King
Disambiguate or not? — The Role of Prosody in Unambiguous and Potentially Ambiguous Anaphora Production in Strictly Mandarin Parallel Structures
Luying Hou, Bert Le Bruyn, René Kager
Discovering Language in Marmoset Vocalization
Sakshi Verma, K.L. Prateek, Karthik Pandia et al.
Discrete Duration Model for Speech Synthesis
Bo Chen, Tianling Bian, Kai Yu
Discretized Continuous Speech Emotion Recognition with Multi-Task Deep Recurrent Neural Network
Duc Le, Zakaria Aldeneh, Emily Mower Provost
Discriminative Autoencoders for Acoustic Modeling
Ming-Han Yang, Hung-Shin Lee, Yu-Ding Lu et al.
Discussion
Björn Schuller, Anton Batliner
Distilling Knowledge from an Ensemble of Models for Punctuation Prediction
Jiangyan Yi, Jianhua Tao, Zhengqi Wen et al.
DNN-Based Feature Extraction and Classifier Combination for Child-Directed Speech, Cold and Snoring Identification
Gábor Gosztolya, Róbert Busa-Fekete, Tamás Grósz et al.
DNN-Based Ultrasound-to-Speech Conversion for a Silent Speech Interface
Tamás Gábor Csapó, Tamás Grósz, Gábor Gosztolya et al.
DNN Bottleneck Features for Speaker Clustering
Jesús Jorrín, Paola García, Luis Buera
DNN i-Vector Speaker Verification with Short, Text-Constrained Test Utterances
Jinghua Zhong, Wenping Hu, Frank K. Soong et al.
DNN-SPACE: DNN-HMM-Based Generative Model of Voice F0Contours for Statistical Phrase/Accent Command Estimation
Nobukatsu Hojo, Yasuhito Ohsugi, Yusuke Ijima et al.
Does Posh English Sound Attractive?
Li Jiao, Chengxia Wang, Cristiane Hsu et al.
Domain Adaptation of PLDA Models in Broadcast Diarization by Means of Unsupervised Speaker Clustering
Ignacio Viñals, Alfonso Ortega, Jesús Villalba et al.
Domain-Independent User Satisfaction Reward Estimation for Dialogue Policy Learning
Stefan Ultes, Paweł Budzianowski, Iñigo Casanueva et al.
Domain Mismatch Modeling of Out-Domain i-Vectors for PLDA Speaker Verification
Md. Hafizur Rahman, Ivan Himawan, David Dean et al.
Domain-Specific Utterance End-Point Detection for Speech Recognition
Roland Maas, Ariya Rastrow, Kyle Goehner et al.
Dominant Distortion Classification for Pre-Processing of Vowels in Remote Biomedical Voice Analysis
Amir Hossein Poorjam, Jesper Rindom Jensen, Max A. Little et al.