Papers
8,761 papers found
Personalized, Cross-Lingual TTS Using Phonetic Posteriorgrams
Lifa Sun, Hao Wang, Shiyin Kang et al.
Personalized Natural Language Understanding
Xiaohu Liu, Ruhi Sarikaya, Liang Zhao et al.
Phase-Aware Signal Processing for Automatic Speech Recognition
Johannes Fahringer, Tobias Schrank, Johannes Stahl et al.
Phase-Encoded Speech Spectrograms
Chandra Sekhar Seelamantula
Phoneme Embedding and its Application to Speech Driven Talking Avatar Synthesis
Xu Li, Zhiyong Wu, Helen Meng et al.
Phoneme, Phone Boundary, and Tone in Automatic Scoring of Mandarin Proficiency
Jiahong Yuan, Mark Liberman
Phoneme Set Design Considering Integrated Acoustic and Linguistic Features of Second Language Speech
Xiaoyun Wang, Tsuneo Kato, Seiichi Yamamoto
Phone Synchronous Decoding with CTC Lattice
Zhehuai Chen, Wei Deng, Tao Xu et al.
Phonetic and Phonological Posterior Search Space Hashing Exploiting Class-Specific Sparsity Structures
Afsaneh Asaei, Gil Luyet, Milos Cernak et al.
Phonetic Context Embeddings for DNN-HMM Phone Recognition
Leonardo Badino
Phonetic Reduction Can Lead to Lengthening, and Enhancement Can Lead to Shortening
Clara Cohen, Matt Carlson
Phonotactic Language Identification for Singing
Anna M. Kruspe
PhonVoc: A Phonetic and Phonological Vocoding Toolkit
Milos Cernak, Philip N. Garner
Pitch-Adaptive Front-End Features for Robust Children’s ASR
S. Shahnawazuddin, Abhishek Dey, Rohit Sinha
Pitch-Range Perception: The Dynamic Interaction Between Voice Quality and Fundamental Frequency
Jianjing Kuang, Mark Liberman
Predicting Affective Dimensions Based on Self Assessed Depression Severity
Rahul Gupta, Shrikanth S. Narayanan
Predicting Binaural Speech Intelligibility from Signals Estimated by a Blind Source Separation Algorithm
Qingju Liu, Yan Tang, Philip J.B. Jackson et al.
Predicting Pronunciations with Syllabification and Stress with Recurrent Neural Networks
Daan van Esch, Mason Chua, Kanishka Rao
Predicting Severity of Voice Disorder from DNN-HMM Acoustic Posteriors
Tan Lee, Yuanyuan Liu, Yu Ting Yeung et al.
Predicting User Satisfaction from Turn-Taking in Spoken Conversations
Shammur Absar Chowdhury, Evgeny A. Stepanov, Giuseppe Riccardi
Prediction and Generation of Backchannel Form for Attentive Listening Systems
Tatsuya Kawahara, Takashi Yamaguchi, Koji Inoue et al.
Prediction of the Articulatory Movements of Unseen Phonemes of a Speaker Using the Speech Structure of Another Speaker
Hidetsugu Uchida, Daisuke Saito, Nobuaki Minematsu
Preliminary Experiments on Unsupervised Word Discovery in Mboshi
Pierre Godard, Gilles Adda, Martine Adda-Decker et al.
Priors for Speaker Counting and Diarization with AHC
Gregory Sell, Alan McCree, Daniel Garcia-Romero