Papers
Large-Scale Visual Speech Recognition
Brendan Shillingford, Yannis Assael, Matthew W. Hoffman et al.
Latent Dirichlet Allocation Based Acoustic Data Selection for Automatic Speech Recognition
Mortaza Doulaty, Thomas Hain
Latent Topic Attention for Domain Classification
Peisong Huang, Peijie Huang, Wencheng Ai et al.
Lattice-Based Lightly-Supervised Acoustic Model Training
Joachim Fainberg, Ondřej Klejch, Steve Renals et al.
Lattice Generation in Attention-Based Speech Recognition Models
Michał Zapotoczny, Piotr Pietrzak, Adrian Łańcucki et al.
Lattice Re-Scoring During Manual Editing for Automatic Error Correction of ASR Transcripts
Anna V. Rúnarsdóttir, Inga R. Helgadóttir, Jón Guðnason
Laughter Dynamics in Dyadic Conversations
Bogdan Ludusan, Petra Wagner
Layer Trajectory BLSTM
Eric Sun, Jinyu Li, Yifan Gong
LEAP Diarization System for the Second DIHARD Challenge
Prachi Singh, Harsha Vardhan M.A., Sriram Ganapathy et al.
Learning Alignment for Multimodal Emotion Recognition from Speech
Haiyang Xu, Hui Zhang, Kun Han et al.
Learning How to Listen: A Temporal-Frequential Attention Model for Sound Event Detection
Yu-Han Shen, Ke-Xin He, Wei-Qiang Zhang
Learning Natural Language Interfaces with Neural Models
Mirella Lapata
Learning Problem-Agnostic Speech Representations from Multiple Self-Supervised Tasks
Santiago Pascual, Mirco Ravanelli, Joan Serrà et al.
Learning Speaker Aware Offsets for Speaker Adaptation of Neural Networks
Leda Sarı, Samuel Thomas, Mark A. Hasegawa-Johnson
Learning Speaker Representations with Mutual Information
Mirco Ravanelli, Yoshua Bengio
Learning Temporal Clusters Using Capsule Routing for Speech Emotion Recognition
Md. Asif Jalal, Erfan Loweimi, Roger K. Moore et al.
Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning
Yu Zhang, Ron J. Weiss, Heiga Zen et al.
Learn Spelling from Teachers: Transferring Knowledge from Language Models to Sequence-to-Sequence Speech Recognition
Ye Bai, Jiangyan Yi, Jianhua Tao et al.
Leveraging a Character, Word and Prosody Triplet for an ASR Error Robust and Agglutination Friendly Punctuation Approach
György Szaszák, Máté Ákos Tündik
Leveraging Acoustic Cues and Paralinguistic Embeddings to Detect Expression from Voice
Vikramjit Mitra, Sue Booker, Erik Marchi et al.
Lexically Guided Perceptual Learning of a Vowel Shift in an Interactive L2 Listening Context
E. Felker, Mirjam Ernestus, Mirjam Broersma
LF-MMI Training of Bayesian and Gaussian Process Time Delay Neural Networks for Speech Recognition
Shoukang Hu, Xurong Xie, Shansong Liu et al.
LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech
Heiga Zen, Viet Dang, Rob Clark et al.
Linear Discriminant Differential Evolution for Feature Selection in Emotional Speech Recognition
Soumaya Gharsellaoui, Sid Ahmed Selouani, Mohammed Sidi Yakoub
Linguistically-Informed Training of Acoustic Word Embeddings for Low-Resource Languages
Zixiaofan Yang, Julia Hirschberg