Papers
Combining Speaker Turn Embedding and Incremental Structure Prediction for Low-Latency Speaker Diarization
Guillaume Wisniewksi, Hervé Bredin, G. Gelly et al.
Comparing Human and Machine Errors in Conversational Speech Transcription
Andreas Stolcke, Jasha Droppo
Comparing Languages Using Hierarchical Prosodic Analysis
Juraj Šimko, Antti Suni, Katri Hiovain et al.
Comparison of Basic Beatboxing Articulations Between Expert and Novice Artists Using Real-Time Magnetic Resonance Imaging
Nimisha Patil, Timothy Greer, Reed Blaylock et al.
Comparison of Decoding Strategies for CTC Acoustic Models
Thomas Zenkel, Ramon Sanabria, Florian Metze et al.
Comparison of Modeling Target in LSTM-RNN Duration Model
Bo Chen, Jiahao Lai, Kai Yu
Comparison of Non-Parametric Bayesian Mixture Models for Syllable Clustering and Zero-Resource Speech Processing
Shreyas Seshadri, Ulpu Remes, Okko Räsänen
Compensating Gender Variability in Query-by-Example Search on Speech Using Voice Conversion
Paula Lopez-Otero, Laura Docio-Fernandez, Carmen Garcia-Mateo
Complexity in Speech and its Relation to Emotional Bond in Therapist-Patient Interactions During Suicide Risk Assessment Interviews
Md. Nasir, Brian Baucom, Craig J. Bryan et al.
Complex-Valued Restricted Boltzmann Machine for Direct Learning of Frequency Spectra
Toru Nakashika, Shinji Takaki, Junichi Yamagishi
Compressed Time Delay Neural Network for Small-Footprint Keyword Spotting
Ming Sun, David Snyder, Yixin Gao et al.
Computational Analysis of Acoustic Descriptors in Psychotic Patients
Torsten Wörtwein, Tadas Baltrušaitis, Eugene Laksana et al.
Computational Simulations of Temporal Vocalization Behavior in Adult-Child Interaction
Ellen Marklund, David Pagmar, Tove Gerholm et al.
Computing Multimodal Dyadic Behaviors During Spontaneous Diagnosis Interviews Toward Automatic Categorization of Autism Spectrum Disorder
Chin-Po Chen, Xian-Hong Tseng, Susan Shur-Fen Gau et al.
Concatenative Resynthesis Using Twin Networks
Soumi Maiti, Michael I. Mandel
Conditional Generative Adversarial Nets Classifier for Spoken Language Identification
Peng Shen, Xugang Lu, Sheng Li et al.
Conditional Generative Adversarial Networks for Speech Enhancement and Noise-Robust Speaker Verification
Daniel Michelsanti, Zheng-Hua Tan
Constructing Acoustic Distances Between Subwords and States Obtained from a Deep Neural Network for Spoken Term Detection
Daisuke Kaneko, Ryota Konno, Kazunori Kojima et al.
Content Normalization for Text-Dependent Speaker Verification
Subhadeep Dey, Srikanth Madikeri, Petr Motlicek et al.
Context Regularity Indexed by Auditory N1 and P2 Event-Related Potentials
Xiao Wang, Yanhui Zhang, Gang Peng
Controlling Prominence Realisation in Parametric DNN-Based Speech Synthesis
Zofia Malisz, Harald Berthelsen, Jonas Beskow et al.
Conversing with Social Agents That Smile and Laugh
Catherine Pelachaud
Convolutional Neural Network to Model Articulation Impairments in Patients with Parkinson’s Disease
J.C. Vásquez-Correa, Juan Rafael Orozco-Arroyave, Elmar Nöth
Convolutional Recurrent Neural Networks for Small-Footprint Keyword Spotting
Sercan Ö. Arık, Markus Kliegl, Rewon Child et al.
Co-Production of Speech and Pointing Gestures in Clear and Perturbed Interactive Tasks: Multimodal Designation Strategies
Marion Dohen, Benjamin Roustan