Papers
Pitch Characteristics of L2 English Speech by Chinese Speakers: A Large-scale Study
Jiahong Yuan, Qiusi Dong, Fei Wu et al.
Play Duration Based User-Entity Affinity Modeling in Spoken Dialog System
Bo Xiao, Nicholas Monath, Shankar Ananthakrishnan et al.
Postfiltering Using Log-Magnitude Spectrum for Speech and Audio Coding
Sneha Das, Tom Bäckström
Postfiltering with Complex Spectral Correlations for Speech and Audio Coding
Sneha Das, Tom Bäckström
Predicting Arousal and Valence from Waveforms and Spectrograms Using Deep Neural Networks
Zixiaofan Yang, Julia Hirschberg
Prediction of Aesthetic Elements in Karnatic Music: A Machine Learning Approach
Ragesh Rajan M, Ashwin Vijayakumar, Deepu Vijayasenan
Prediction of Perceived Speech Quality Using Deep Machine Listening
Jasper Ooster, Rainer Huber, Bernd T. Meyer
Prediction of Subjective Listening Effort from Acoustic Data with Non-Intrusive Deep Models
Paul Kranzusch, Rainer Huber, Melanie Krüger et al.
Prediction of Turn-taking Using Multitask Learning with Prediction of Backchannels and Fillers
Kohei Hara, Koji Inoue, Katsuya Takanashi et al.
Preference-Learning with Qualitative Agreement for Sentence Level Emotional Annotations
Srinivas Parthasarathy, Carlos Busso
Processing Transition Regions of Glottal Stop Substituted /S/ for Intelligibility Enhancement of Cleft Palate Speech
Protima Nomo Sudro, Sishir Kalita, S R Mahadeva Prasanna
Prominence-based Evaluation of L2 Prosody
Heini Kallio, Antti Suni, Päivi Virkkunen et al.
Prosodic Focus Acquisition in French Early Cochlear Implanted Children
Chadi Farah, Stephane Roman, Mariapaola D'Imperio
Punctuation Prediction Model for Conversational Speech
Piotr Żelasko, Piotr Szymański, Jan Mizgajski et al.
Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model Based on BLSTM
Szu-wei Fu, Yu Tsao, Hsin-Te Hwang et al.
Quaternion Convolutional Neural Networks for End-to-End Automatic Speech Recognition
Titouan Parcollet, Ying Zhang, Mohamed Morchid et al.
Rapid Collection of Spontaneous Speech Corpora Using Telephonic Community Forums
Agha Ali Raza, Awais Athar, Shan Randhawa et al.
Rapid Style Adaptation Using Residual Error Embedding for Expressive Speech Synthesis
Xixin Wu, Yuewen Cao, Mu Wang et al.
R-CRNN: Region-based Convolutional Recurrent Neural Network for Audio Event Detection
Chieh-Chi Kao, Weiran Wang, Ming Sun et al.
Recognition of Echolalic Autistic Child Vocalisations Utilising Convolutional Recurrent Neural Networks
Shahin Amiriparian, Alice Baird, Sahib Julka et al.
Recognizing Overlapped Speech in Meetings: A Multichannel Separation Approach Using Neural Networks
Takuya Yoshioka, Hakan Erdogan, Zhuo Chen et al.