Papers
8,761 papers found
Discovering Language in Marmoset Vocalization
Sakshi Verma, K.L. Prateek, Karthik Pandia et al.
Discrete Duration Model for Speech Synthesis
Bo Chen, Tianling Bian, Kai Yu
Discretized Continuous Speech Emotion Recognition with Multi-Task Deep Recurrent Neural Network
Duc Le, Zakaria Aldeneh, Emily Mower Provost
Discriminative Autoencoders for Acoustic Modeling
Ming-Han Yang, Hung-Shin Lee, Yu-Ding Lu et al.
Distilling Knowledge from an Ensemble of Models for Punctuation Prediction
Jiangyan Yi, Jianhua Tao, Zhengqi Wen et al.
DNN-Based Feature Extraction and Classifier Combination for Child-Directed Speech, Cold and Snoring Identification
Gábor Gosztolya, Róbert Busa-Fekete, Tamás Grósz et al.
DNN-Based Ultrasound-to-Speech Conversion for a Silent Speech Interface
Tamás Gábor Csapó, Tamás Grósz, Gábor Gosztolya et al.
DNN Bottleneck Features for Speaker Clustering
Jesús Jorrín, Paola García, Luis Buera
DNN i-Vector Speaker Verification with Short, Text-Constrained Test Utterances
Jinghua Zhong, Wenping Hu, Frank K. Soong et al.
DNN-SPACE: DNN-HMM-Based Generative Model of Voice F0Contours for Statistical Phrase/Accent Command Estimation
Nobukatsu Hojo, Yasuhito Ohsugi, Yusuke Ijima et al.
Does Posh English Sound Attractive?
Li Jiao, Chengxia Wang, Cristiane Hsu et al.
Domain Adaptation of PLDA Models in Broadcast Diarization by Means of Unsupervised Speaker Clustering
Ignacio Viñals, Alfonso Ortega, Jesús Villalba et al.
Domain-Independent User Satisfaction Reward Estimation for Dialogue Policy Learning
Stefan Ultes, Paweł Budzianowski, Iñigo Casanueva et al.
Domain Mismatch Modeling of Out-Domain i-Vectors for PLDA Speaker Verification
Md. Hafizur Rahman, Ivan Himawan, David Dean et al.
Domain-Specific Utterance End-Point Detection for Speech Recognition
Roland Maas, Ariya Rastrow, Kyle Goehner et al.
Dominant Distortion Classification for Pre-Processing of Vowels in Remote Biomedical Voice Analysis
Amir Hossein Poorjam, Jesper Rindom Jensen, Max A. Little et al.
Don’t Count on ASR to Transcribe for You: Breaking Bias with Two Crowds
Michael Levit, Yan Huang, Shuangyu Chang et al.
Duration Mismatch Compensation Using Four-Covariance Model and Deep Neural Network for Speaker Verification
Pierre-Michel Bousquet, Mickael Rouvier
Dynamic Layer Normalization for Adaptive Neural Acoustic Modeling in Speech Recognition
Taesup Kim, Inchul Song, Yoshua Bengio
Dysprosody Differentiate Between Parkinson’s Disease, Progressive Supranuclear Palsy, and Multiple System Atrophy
Jan Hlavnička, Tereza Tykalová, Roman Čmejla et al.
Earlier Identification of Children with Autism Spectrum Disorder: An Automatic Vocalisation-Based Approach
Florian B. Pokorny, Björn Schuller, Peter B. Marschik et al.
Effectively Building Tera Scale MaxEnt Language Models Incorporating Non-Linguistic Signals
Fadi Biadsy, Mohammadreza Ghodsi, Diamantino Caseiro
Effect of Formant and F0 Discontinuity on Perceived Vowel Duration: Impacts for Concatenative Speech Synthesis
Tomáš Bořil, Pavel Šturm, Radek Skarnitzl et al.
Effect of Language, Speaking Style and Speaker on Long-Term F0 Estimation
Pablo Arantes, Anders Eriksson, Suska Gutzeit