Papers
Combining Adversarial Training and Disentangled Speech Representation for Robust Zero-Resource Subword Modeling
Siyuan Feng, Tan Lee, Zhiyuan Peng
Combining Speaker Recognition and Metric Learning for Speaker-Dependent Representation Learning
João Monteiro, Jahangir Alam, Tiago H. Falk
Comparative Analysis of Prosodic Characteristics Using WaveNet Embeddings
Antti Suni, Marcin Włodarczak, Martti Vainio et al.
Comparative Analysis of Think-Aloud Methods for Everyday Activities in the Context of Cognitive Robotics
Moritz Meier, Celeste Mason, Felix Putze et al.
Comparative Study of Parametric and Representation Uncertainty Modeling for Recurrent Neural Network Language Models
Jianwei Yu, Max W.Y. Lam, Shoukang Hu et al.
Comparison of Lattice-Free and Lattice-Based Sequence Discriminative Training Criteria for LVCSR
Wilfried Michel, Ralf Schlüter, Hermann Ney
Comparison of Speech Tasks and Recording Devices for Voice Based Automatic Classification of Healthy Subjects and Patients with Amyotrophic Lateral Sclerosis
Suhas B.N., Deep Patel, Nithin Rao et al.
Comparison of Telephone Recordings and Professional Microphone Recordings for Early Detection of Parkinson’s Disease, Using Mel-Frequency Cepstral Coefficients with Gaussian Mixture Models
Laetitia Jeancolas, Graziella Mangone, Jean-Christophe Corvol et al.
Compensation for French Liquid Deletion During Auditory Sentence Processing
Sharon Peperkamp, Alvaro Martin Iturralde Zurita
Completely Unsupervised Phoneme Recognition by a Generative Adversarial Network Harmonized with Iteratively Refined Hidden Markov Models
Kuan-Yu Chen, Che-Ping Tsai, Da-Rong Liu et al.
Compression of Acoustic Event Detection Models with Quantized Distillation
Bowen Shi, Ming Sun, Chieh-Chi Kao et al.
Compression of CTC-Trained Acoustic Models by Dynamic Frame-Wise Distillation or Segment-Wise N-Best Hypotheses Imitation
Haisong Ding, Kai Chen, Qiang Huo
“Computer, Test My Hearing”: Accurate Speech Audiometry with Smart Speakers
Jasper Ooster, Pia Nancy Porysek Moreta, Jörg-Hendrik Bach et al.
Conditional Variational Auto-Encoder for Text-Driven Expressive AudioVisual Speech Synthesis
Sara Dahmani, Vincent Colotte, Valérian Girard et al.
Connecting and Comparing Language Model Interpolation Techniques
Ernest Pusateri, Christophe Van Gysel, Rami Botros et al.
Consonant Classification in Mandarin Based on the Depth Image Feature: A Pilot Study
Han-Chi Hsieh, Wei-Zhong Zheng, Ko-Chiang Chen et al.
Constrained Output Embeddings for End-to-End Code-Switching Speech Recognition with Only Monolingual Data
Yerbolat Khassanov, Haihua Xu, Van Tung Pham et al.
Contextual Recovery of Out-of-Lattice Named Entities in Automatic Speech Recognition
Jack Serrino, Leonid Velikovich, Petar Aleksic et al.
Continuous Emotion Recognition in Speech — Do We Need Recurrence?
Maximilian Schmitt, Nicholas Cummins, Björn W. Schuller
Conversational and Social Laughter Synthesis with WaveNet
Hiroki Mori, Tomohiro Nagata, Yoshiko Arimoto
Conversational Emotion Analysis via Attention Mechanisms
Zheng Lian, Jianhua Tao, Bin Liu et al.
Convolutional Neural Network-Based Speech Enhancement for Cochlear Implant Recipients
Nursadul Mamun, Soheil Khorram, John H.L. Hansen
Corpus Design Using Convolutional Auto-Encoder Embeddings for Audio-Book Synthesis
Meysam Shamsi, Damien Lolive, Nelly Barbot et al.
CRIM’s Speech Transcription and Call Sign Detection System for the ATC Airbus Challenge Task
Vishwa Gupta, Lise Rebout, Gilles Boulianne et al.