Papers
L2-ARCTIC: A Non-native English Speech Corpus
Guanlong Zhao, Sinem Sonsaat, Alif Silpachai et al.
Ladder Networks for Emotion Recognition: Using Unsupervised Auxiliary Tasks to Improve Predictions of Emotional Attributes
Srinivas Parthasarathy, Carlos Busso
Language-Dependent Melody Embeddings
Daniil Kocharov, Alla Menshikova
Language Features for Automated Evaluation of Cognitive Behavior Psychotherapy Sessions
Nikolaos Flemotomos, Victor Martinez, James Gibson et al.
Large Vocabulary Concatenative Resynthesis
Soumi Maiti, Joey Ching, Michael Mandel
Latent Factor Analysis of Deep Bottleneck Features for Speaker Verification with Random Digit Strings
Ziqiang Shi, Huibin Lin, Liu Liu et al.
Lattice-free State-level Minimum Bayes Risk Training of Acoustic Models
Naoyuki Kanda, Yusuke Fujita, Kenji Nagamatsu
Layer Trajectory LSTM
Jinyu Li, Changliang Liu, Yifan Gong
Learning Acoustic Word Embeddings with Temporal Context for Query-by-Example Speech Search
Yougen Yuan, Cheung-Chi Leung, Lei Xie et al.
Learning and Modeling Unit Embeddings for Improving HMM-based Unit Selection Speech Synthesis
Xiao Zhou, Zhen-Hua Ling, Zhi-Ping Zhou et al.
Learning Conditional Acoustic Latent Representation with Gender and Age Attributes for Automatic Pain Level Recognition
Jeng-Lin Li, Yi-Ming Weng, Chip-Jin Ng et al.
Learning Discriminative Features for Speaker Identification and Verification
Sarthak Yadav, Atul Rai
Learning Interpretable Control Dimensions for Speech Synthesis by Using External Data
Zack Hodari, Oliver Watts, Srikanth Ronanki et al.
Learning Spontaneity to Improve Emotion Recognition in Speech
Karttikeya Mangalam, Tanaya Guha
Learning Structured Dictionaries for Exemplar-based Voice Conversion
Shaojin Ding, Christopher Liberatore, Ricardo Gutierrez-Osuna
Learning to Adapt: A Meta-learning Approach for Speaker Adaptation
Ondřej Klejch, Joachim Fainberg, Peter Bell
Learning Two Tone Languages Enhances the Brainstem Encoding of Lexical Tones
Akshay Raj Maggu, Wenqing Zong, Vina Law et al.
Learning Word Embeddings: Unsupervised Methods for Fixed-size Representations of Variable-length Speech Segments
Nils Holzenberger, Mingxing Du, Julien Karadayi et al.
Length Contrast and Covarying Features: Whistled Speech as a Case Study
Rachid Ridouane, Giuseppina Turco, Julien Meyer
Leveraging Native Language Information for Improved Accented Speech Recognition
Shahram Ghorbani, John H.L. Hansen
Leveraging Second-Order Log-Linear Model for Improved Deep Learning Based ASR Performance
Ankit Raj, Shakti P Rath, Jithendra Vepa
Leveraging Translations for Speech Transcription in Low-resource Settings
Antonios Anastasopoulos, David Chiang
Lexical and Acoustic Deep Learning Model for Personality Recognition
Guozhen An, Rivka Levitan
Lightly Supervised vs. Semi-supervised Training of Acoustic Model on Luxembourgish for Low-resource Automatic Speech Recognition
Karel Veselý, Carlos Segura, Igor Szöke et al.
Linear Prediction Residual based Short-term Cepstral Features for Replay Attacks Detection
Madhusudan Singh, Debadatta Pati