Papers
LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech
Heiga Zen, Viet Dang, Rob Clark et al.
Linear Discriminant Differential Evolution for Feature Selection in Emotional Speech Recognition
Soumaya Gharsellaoui, Sid Ahmed Selouani, Mohammed Sidi Yakoub
Linguistically-Informed Training of Acoustic Word Embeddings for Low-Resource Languages
Zixiaofan Yang, Julia Hirschberg
Linguistically Motivated Parallel Data Augmentation for Code-Switch Language Modeling
Grandee Lee, Xianghu Yue, Haizhou Li
LipSound: Neural Mel-Spectrogram Reconstruction for Lip Reading
Leyuan Qu, Cornelius Weber, Stefan Wermter
Liquid Deletion in French Child-Directed Speech
Sharon Peperkamp, Monica Hegde, Maria Julia Carbajal
Listen, Attend, Spell and Adapt: Speaker Adapted Sequence-to-Sequence ASR
Felix Weninger, Jesús Andrés-Ferrer, Xinwei Li et al.
Listener Preference on the Local Criterion for Ideal Binary-Masked Speech
Zhuohuang Zhang, Yi Shen
Listeners’ Ability to Identify the Gender of Preadolescent Children in Different Linguistic Contexts
Shawn Nissen, Sharalee Blunck, Anita Dromey et al.
Listening with Great Expectations: An Investigation of Word Form Anticipations in Naturalistic Speech
M. Bentum, L. ten Bosch, A. van den Bosch et al.
Locality-Constrained Linear Coding Based Fused Visual Features for Robust Acoustic Event Classification
Manjunath Mulimani, Shashidhar G. Koolagudi
Lombard Speech Synthesis Using Transfer Learning in a Tacotron Text-to-Speech System
Bajibabu Bollepalli, Lauri Juvela, Paavo Alku
Long Range Acoustic Features for Spoofed Speech Detection
Rohan Kumar Das, Jichen Yang, Haizhou Li
Low-Dimensional Bottleneck Features for On-Device Continuous Speech Recognition
David B. Ramsay, Kevin Kilgour, Dominik Roblek et al.
Low Resource Automatic Intonation Classification Using Gated Recurrent Unit (GRU) Networks Pre-Trained with Synthesized Pitch Patterns
Atreyee Saha, Chiranjeevi Yarra, Prasanta Kumar Ghosh
LSTM Based Similarity Measurement with Spectral Clustering for Speaker Diarization
Qingjian Lin, Ruiqing Yin, Ming Li et al.
Lyrics Recognition from Singing Voice Focused on Correspondence Between Voice and Notes
Motoyuki Suzuki, Sho Tomita, Tomoki Morita
M2H-GAN: A GAN-Based Mapping from Machine to Human Transcripts for Speech Understanding
Titouan Parcollet, Mohamed Morchid, Xavier Bost et al.
Masking Estimation with Phase Restoration of Clean Speech for Monaural Speech Enhancement
Xianyun Wang, Changchun Bao
Maximum a posteriori Speech Enhancement Based on Double Spectrum
Pejman Mowlaee, Daniel Scheran, Johannes Stahl et al.
MCE 2018: The 1st Multi-Target Speaker Detection and Identification Challenge Evaluation
Suwon Shon, Najim Dehak, Douglas Reynolds et al.
Meeting Transcription Using Asynchronous Distant Microphones
Takuya Yoshioka, Dimitrios Dimitriadis, Andreas Stolcke et al.
Mel-Frequency Cepstral Coefficients of Voice Source Waveforms for Classification of Phonation Types in Speech
Sudarsana Reddy Kadiri, Paavo Alku
Meta Learning for Hyperparameter Optimization in Dialogue System
Jen-Tzung Chien, Wei Xiang Lieow
Mining Polysemous Triplets with Recurrent Neural Networks for Spoken Language Understanding
Vedran Vukotić, Christian Raymond