Papers
Don’t Count on ASR to Transcribe for You: Breaking Bias with Two Crowds
Michael Levit, Yan Huang, Shuangyu Chang et al.
Duration Mismatch Compensation Using Four-Covariance Model and Deep Neural Network for Speaker Verification
Pierre-Michel Bousquet, Mickael Rouvier
Dynamic Layer Normalization for Adaptive Neural Acoustic Modeling in Speech Recognition
Taesup Kim, Inchul Song, Yoshua Bengio
Dysprosody Differentiate Between Parkinson’s Disease, Progressive Supranuclear Palsy, and Multiple System Atrophy
Jan Hlavnička, Tereza Tykalová, Roman Čmejla et al.
Earlier Identification of Children with Autism Spectrum Disorder: An Automatic Vocalisation-Based Approach
Florian B. Pokorny, Björn Schuller, Peter B. Marschik et al.
Effectively Building Tera Scale MaxEnt Language Models Incorporating Non-Linguistic Signals
Fadi Biadsy, Mohammadreza Ghodsi, Diamantino Caseiro
Effect of Formant and F0 Discontinuity on Perceived Vowel Duration: Impacts for Concatenative Speech Synthesis
Tomáš Bořil, Pavel Šturm, Radek Skarnitzl et al.
Effect of Language, Speaking Style and Speaker on Long-Term F0 Estimation
Pablo Arantes, Anders Eriksson, Suska Gutzeit
Effects of Talker Dialect, Gender & Race on Accuracy of Bing Speech and YouTube Automatic Captions
Rachael Tatman, Conner Kasten
Effects of Training Data Variety in Generating Glottal Pulses from Acoustic Features with DNNs
Manu Airaksinen, Paavo Alku
Efficient Emotion Recognition from Speech Using Deep Learning on Spectrograms
Aharon Satt, Shai Rozenberg, Ron Hoory
Efficient Knowledge Distillation from an Ensemble of Teachers
Takashi Fukuda, Masayuki Suzuki, Gakuto Kurata et al.
Eigenvector-Based Speech Mask Estimation Using Logistic Regression
Lukas Pfeifenberger, Matthias Zöhrer, Franz Pernkopf
Electrophysiological Correlates of Familiar Voice Recognition
Julien Plante-Hébert, Victor J. Boucher, Boutheina Jemel
Elicitation Design for Acoustic Depression Classification: An Investigation of Articulation Effort, Linguistic Complexity, and Word Affect
Brian Stasak, Julien Epps, Roland Goecke
Eliciting Meaningful Units from Speech
Daniil Kocharov, Tatiana Kachkovskaia, Pavel Skrelin
Embedding-Based Speaker Adaptive Training of Deep Neural Networks
Xiaodong Cui, Vaibhava Goel, George Saon
Emojive! Collecting Emotion Data from Speech and Facial Expression Using Mobile Game App
Ji Ho Park, Nayeon Lee, Dario Bertero et al.
Emotional Features for Speech Overlaps Classification
Olga Egorow, Andreas Wendemuth
Emotional Speech of Mentally and Physically Disabled Individuals: Introducing the EmotAsS Database and First Findings
Simone Hantke, Hesam Sagha, Nicholas Cummins et al.
Emotional Thin-Slicing: A Proposal for a Short- and Long-Term Division of Emotional Speech
Daniel Oliveira Peres, Dominic Watt, Waldemar Ferreira Netto
Emotional Voice Conversion with Adaptive Scales F0 Based on Wavelet Transform Using Limited Amount of Emotional Data
Zhaojie Luo, Jinhui Chen, Tetsuya Takiguchi et al.
Emotion Category Mapping to Emotional Space by Cross-Corpus Emotion Labeling
Yoshiko Arimoto, Hiroki Mori
Empirical Evaluation of Parallel Training Algorithms on Acoustic Modeling
Wenpeng Li, Binbin Zhang, Lei Xie et al.