Papers
8,761 papers found
Deep Learning Techniques in Tandem with Signal Processing Cues for Phonetic Segmentation for Text to Speech Synthesis in Indian Languages
Arun Baby, Jeena J. Prakash, Rupak Vignesh et al.
Deep Least Squares Regression for Speaker Adaptation
Younggwan Kim, Hyungjun Lim, Jahyun Goo et al.
Deep Neural Factorization for Speech Recognition
Jen-Tzung Chien, Chen Shen
Deep Neural Network Embeddings for Text-Independent Speaker Verification
David Snyder, Daniel Garcia-Romero, Daniel Povey et al.
Deep Recurrent Neural Network Based Monaural Speech Separation Using Recurrent Temporal Restricted Boltzmann Machines
Suman Samui, Indrajit Chakrabarti, Soumya K. Ghosh
Deep Reinforcement Learning of Dialogue Policies with Less Weight Updates
Heriberto Cuayáhuitl, Seunghak Yu
Deep Speaker Embeddings for Short-Duration Speaker Verification
Gautam Bhattacharya, Jahangir Alam, Patrick Kenny
Deep Speaker Feature Learning for Text-Independent Speaker Verification
Lantian Li, Yixiang Chen, Ying Shi et al.
Denoising Recurrent Neural Network for Deep Bidirectional LSTM Based Voice Conversion
Jie Wu, D.-Y. Huang, Lei Xie et al.
Depression Detection Using Automatic Transcriptions of De-Identified Speech
Paula Lopez-Otero, Laura Docio-Fernandez, Alberto Abad et al.
Detecting Overlapped Speech on Short Timeframes Using Deep Learning
Valentin Andrei, Horia Cucu, Corneliu Burileanu
Detection of Mispronunciations and Disfluencies in Children Reading Aloud
Jorge Proença, Carla Lopes, Michael Tjalve et al.
Detection of Replay Attacks Using Single Frequency Filtering Cepstral Coefficients
K.N.R.K. Raju Alluri, Sivanand Achanta, Sudarsana Reddy Kadiri et al.
Developing an Embosi (Bantu C25) Speech Variant Dictionary to Model Vowel Elision and Morpheme Deletion
Jamison Cooper-Leavitt, Lori Lamel, Annie Rialland et al.
Developing On-Line Speaker Diarization System
Dimitrios Dimitriadis, Petr Fousek
Dialect Perception by Older Children
Ewa Jacewicz, Robert A. Fox
Dialect Recognition Based on Unsupervised Bottleneck Features
Qian Zhang, John H.L. Hansen
Dialogue as Collaborative Problem Solving
James Allen
“Did you laugh enough today?” — Deep Neural Networks for Mobile and Wearable Laughter Trackers
Gerhard Hagerer, Nicholas Cummins, Florian Eyben et al.
Direct Acoustics-to-Word Models for English Conversational Speech Recognition
Kartik Audhkhasi, Bhuvana Ramabhadran, George Saon et al.
Direct Modeling of Frequency Spectra and Waveform Generation Based on Phase Recovery for DNN-Based Speech Synthesis
Shinji Takaki, Hirokazu Kameoka, Junichi Yamagishi
Direct Modelling of Magnitude and Phase Spectra for Statistical Parametric Speech Synthesis
Felipe Espic, Cassia Valentini Botinhao, Simon King
Disambiguate or not? — The Role of Prosody in Unambiguous and Potentially Ambiguous Anaphora Production in Strictly Mandarin Parallel Structures
Luying Hou, Bert Le Bruyn, René Kager