Papers
8,761 papers found
On the Issue of Calibration in DNN-Based Speaker Recognition Systems
Mitchell McLaren, Diego Castan, Luciana Ferrer et al.
On the Role of Nonlinear Transformations in Deep Neural Network Acoustic Models
Tasha Nagamine, Michael L. Seltzer, Nima Mesgarani
On the Suitability of Vocalic Sandwiches in a Corpus-Based TTS Engine
David Guennec, Damien Lolive
On the Use of Gaussian Mixture Model Framework to Improve Speaker Adaptation of Deep Neural Network Acoustic Models
Natalia Tomashenko, Yuri Khokhlov, Yannick Estève
Open-Domain Audio-Visual Speech Recognition: A Deep Learning Approach
Yajie Miao, Florian Metze
Open Language Interface for Voice Exploitation (OLIVE)
Aaron Lawson, Mitchell McLaren, Harry Bratt et al.
Open Source Speech and Language Resources for Frisian
Emre Yılmaz, Henk van den Heuvel, Jelske Dijkstra et al.
Optimization of Speech Enhancement Front-End with Speech Recognition-Level Criterion
Takuya Higuchi, Takuya Yoshioka, Tomohiro Nakatani
Optimizing Speech Recognition Evaluation Using Stratified Sampling
Janne Pylkkönen, Thomas Drugman, Max Bisani
Out of Set Language Modelling in Hierarchical Language Identification
Saad Irtza, Vidhyasaharan Sethu, Sarith Fernando et al.
Overcoming Data Sparsity in Acoustic Modeling of Low-Resource Language by Borrowing Data and Model Parameters from High-Resource Languages
Basil Abraham, S. Umesh, Neethu Mariam Joy
Pair-Wise Distance Metric Learning of Neural Network Model for Spoken Language Identification
Xugang Lu, Peng Shen, Yu Tsao et al.
Parallel Dictionary Learning for Voice Conversion Using Discriminative Graph-Embedded Non-Negative Matrix Factorization
Ryo Aihara, Tetsuya Takiguchi, Yasuo Ariki
Parallel Speaker and Content Modelling for Text-Dependent Speaker Verification
Jianbo Ma, Saad Irtza, Kaavya Sriskandaraja et al.
Parkinson’s Disease Progression Assessment from Speech Using GMM-UBM
T. Arias-Vergara, J.C. Vasquez-Correa, Juan Rafael Orozco-Arroyave et al.
Part-of-Speech Tagging and Chunking in Text-to-Speech Synthesis for South African Languages
Georg I. Schlünz, Nkosikhona Dlamini, Rynhardt P. Kruger
Pause Prediction from Text for Speech Synthesis with User-Definable Pause Insertion Likelihood Threshold
Norbert Braunschweiler, Ranniery Maia
Perceived Usability and Cognitive Demand of Secondary Tasks in Spoken Versus Visual-Manual Automotive Interaction
Annika Silvervarg, Sofia Lindvall, Jonatan Andersson et al.
Perception of Tone in Whispered Mandarin Sentences: The Case for Singapore Mandarin
Yuling Gu, Boon Pang Lim, Nancy F. Chen
Perception Optimized Deep Denoising AutoEncoders for Speech Enhancement
Prashanth Gurunath Shivakumar, Panayiotis Georgiou
Perceptual Lateralization of Coda Rhotic Production in Puerto Rican Spanish
Mairym Lloréns Monteserín, Shrikanth S. Narayanan, Louis Goldstein
Perceptual Salience of Voice Source Parameters in Signaling Focal Prominence
Irena Yanushevskaya, Andy Murphy, Christer Gobl et al.