Papers
8,761 papers found
Effects of Talker Dialect, Gender & Race on Accuracy of Bing Speech and YouTube Automatic Captions
Rachael Tatman, Conner Kasten
Effects of Training Data Variety in Generating Glottal Pulses from Acoustic Features with DNNs
Manu Airaksinen, Paavo Alku
Efficient Emotion Recognition from Speech Using Deep Learning on Spectrograms
Aharon Satt, Shai Rozenberg, Ron Hoory
Efficient Knowledge Distillation from an Ensemble of Teachers
Takashi Fukuda, Masayuki Suzuki, Gakuto Kurata et al.
Eigenvector-Based Speech Mask Estimation Using Logistic Regression
Lukas Pfeifenberger, Matthias Zöhrer, Franz Pernkopf
Electrophysiological Correlates of Familiar Voice Recognition
Julien Plante-Hébert, Victor J. Boucher, Boutheina Jemel
Elicitation Design for Acoustic Depression Classification: An Investigation of Articulation Effort, Linguistic Complexity, and Word Affect
Brian Stasak, Julien Epps, Roland Goecke
Eliciting Meaningful Units from Speech
Daniil Kocharov, Tatiana Kachkovskaia, Pavel Skrelin
Embedding-Based Speaker Adaptive Training of Deep Neural Networks
Xiaodong Cui, Vaibhava Goel, George Saon
Emojive! Collecting Emotion Data from Speech and Facial Expression Using Mobile Game App
Ji Ho Park, Nayeon Lee, Dario Bertero et al.
Emotional Features for Speech Overlaps Classification
Olga Egorow, Andreas Wendemuth
Emotional Speech of Mentally and Physically Disabled Individuals: Introducing the EmotAsS Database and First Findings
Simone Hantke, Hesam Sagha, Nicholas Cummins et al.
Emotional Thin-Slicing: A Proposal for a Short- and Long-Term Division of Emotional Speech
Daniel Oliveira Peres, Dominic Watt, Waldemar Ferreira Netto
Emotional Voice Conversion with Adaptive Scales F0 Based on Wavelet Transform Using Limited Amount of Emotional Data
Zhaojie Luo, Jinhui Chen, Tetsuya Takiguchi et al.
Emotion Category Mapping to Emotional Space by Cross-Corpus Emotion Labeling
Yoshiko Arimoto, Hiroki Mori
Empirical Evaluation of Parallel Training Algorithms on Acoustic Modeling
Wenpeng Li, Binbin Zhang, Lei Xie et al.
Empirical Exploration of Novel Architectures and Objectives for Language Models
Gakuto Kurata, Abhinav Sethy, Bhuvana Ramabhadran et al.
End-of-Utterance Prediction by Prosodic Features and Phrase-Dependency Structure in Spontaneous Japanese Speech
Yuichi Ishimoto, Takehiro Teraoka, Mika Enomoto
Endpoint Detection Using Grid Long Short-Term Memory Networks for Streaming Speech Recognition
Shuo-Yiin Chang, Bo Li, Tara N. Sainath et al.
End-to-End Acoustic Feedback in Language Learning for Correcting Devoiced French Final-Fricatives
Sucheta Ghosh, Camille Fauth, Yves Laprie et al.
End-to-End Deep Learning Framework for Speech Paralinguistics Detection Based on Perception Aware Spectrum
Danwei Cai, Zhidong Ni, Wenbo Liu et al.
End-to-End Language Identification Using High-Order Utterance Representation with Bilinear Pooling
Ma Jin, Yan Song, Ian McLoughlin et al.
End-to-End Text-Independent Speaker Verification with Triplet Loss on Short Utterances
Chunlei Zhang, Kazuhito Koishida
End-to-End Training of Acoustic Models for Large Vocabulary Continuous Speech Recognition with TensorFlow
Ehsan Variani, Tom Bagby, Erik McDermott et al.