Papers
8,761 papers found
Improved Time-Frequency Trajectory Excitation Vocoder for DNN-Based Speech Synthesis
Eunwoo Song, Frank K. Soong, Hong-Goo Kang
Improving Automatic Recognition of Aphasic Speech with AphasiaBank
Duc Le, Emily Mower Provost
Improving Boundary Estimation in Audiovisual Speech Activity Detection Using Bayesian Information Criterion
Fei Tao, John H.L. Hansen, Carlos Busso
Improving Children’s Speech Recognition Through Out-of-Domain Data Augmentation
Joachim Fainberg, Peter Bell, Mike Lincoln et al.
Improving Deep Neural Networks Based Speaker Verification Using Unlabeled Data
Yao Tian, Meng Cai, Liang He et al.
Improving English Conversational Telephone Speech Recognition
Ivan Medennikov, Alexey Prudnikov, Alexander Zatvornitskiy
Improving Generalisation to New Speakers in Spoken Dialogue State Tracking
Iñigo Casanueva, Thomas Hain, Phil Green
Improving i-Vector and PLDA Based Speaker Clustering with Long-Term Features
Abraham Woubie, Jordi Luque, Javier Hernando
Improving Large Vocabulary Accented Mandarin Speech Recognition with Attribute-Based I-Vectors
Hao Zheng, Shanshan Zhang, Liwei Qiao et al.
Improving Prosodic Boundaries Prediction for Mandarin Speech Synthesis by Using Enhanced Embedding Feature and Model Fusion Approach
Yibin Zheng, Ya Li, Zhengqi Wen et al.
Improving the Lwazi ASR Baseline
Charl van Heerden, Neil Kleynhans, Marelie Davel
Improving the Probabilistic Framework for Representing Dialogue Systems with User Response Model
Miao Li, Zhipeng Chen, Ji Wu
Improving TTS with Corpus-Specific Pronunciation Adaptation
Marie Tahon, Raheel Qader, Gwénolé Lecorvé et al.
Improving Under-Resourced Language ASR Through Latent Subword Unit Space Discovery
Marzieh Razavi, Mathew Magimai-Doss
Incorporating a Generative Front-End Layer to Deep Neural Network for Noise Robust Automatic Speech Recognition
Souvik Kundu, Khe Chai Sim, Mark J.F. Gales
Individual Identity in Songbirds: Signal Representations and Metric Learning for Locating the Information in Complex Corvid Calls
Dan Stowell, Veronica Morfi, Lisa F. Gill
Inferring Phonemic Classes from CNN Activation Maps Using Clustering Techniques
Thomas Pellegrini, Sandrine Mouysset
Integrated Spoofing Countermeasures and Automatic Speaker Verification: An Evaluation on ASVspoof 2015
Md. Sahidullah, Héctor Delgado, Massimiliano Todisco et al.
Intelligibility of Disordered Speech: Global and Detailed Scores
Mario Ganzeboom, Marjoke Bakker, Catia Cucchiarini et al.
Interaction Between Lexical Tone and Intonation: An EMA Study
Hao Yi, Sam Tilsen
Interactive Spoken Content Retrieval by Deep Reinforcement Learning
Yen-Chen Wu, Tzu-Hsiang Lin, Yang-De Chen et al.
Interpretation of Low Dimensional Neural Network Bottleneck Features in Terms of Human Perception and Production
Philip Weber, Linxue Bai, Martin Russell et al.
Inter-Speech Clicks in an Interspeech Keynote
Jürgen Trouvain, Zofia Malisz
Inter-Task System Fusion for Speaker Recognition
M. Ferras, Srikanth Madikeri, S. Dey et al.