Papers
Large-Scale Multilingual Speech Recognition with a Streaming End-to-End Model
Anjuli Kannan, Arindrima Datta, Tara N. Sainath et al.
Large-Scale Speaker Diarization of Radio Broadcast Archives
Emre Yılmaz, Adem Derinel, Kun Zhou et al.
Large-Scale Speaker Retrieval on Random Speaker Variability Subspace
Suwon Shon, Younggun Lee, Taesu Kim
Large-Scale Visual Speech Recognition
Brendan Shillingford, Yannis Assael, Matthew W. Hoffman et al.
Latent Dirichlet Allocation Based Acoustic Data Selection for Automatic Speech Recognition
Mortaza Doulaty, Thomas Hain
Latent Topic Attention for Domain Classification
Peisong Huang, Peijie Huang, Wencheng Ai et al.
Lattice-Based Lightly-Supervised Acoustic Model Training
Joachim Fainberg, Ondřej Klejch, Steve Renals et al.
Lattice Generation in Attention-Based Speech Recognition Models
Michał Zapotoczny, Piotr Pietrzak, Adrian Łańcucki et al.
Lattice Re-Scoring During Manual Editing for Automatic Error Correction of ASR Transcripts
Anna V. Rúnarsdóttir, Inga R. Helgadóttir, Jón Guðnason
Laughter Dynamics in Dyadic Conversations
Bogdan Ludusan, Petra Wagner
Layer Trajectory BLSTM
Eric Sun, Jinyu Li, Yifan Gong
LEAP Diarization System for the Second DIHARD Challenge
Prachi Singh, Harsha Vardhan M.A., Sriram Ganapathy et al.
Learning Alignment for Multimodal Emotion Recognition from Speech
Haiyang Xu, Hui Zhang, Kun Han et al.
Learning How to Listen: A Temporal-Frequential Attention Model for Sound Event Detection
Yu-Han Shen, Ke-Xin He, Wei-Qiang Zhang
Learning Natural Language Interfaces with Neural Models
Mirella Lapata
Learning Problem-Agnostic Speech Representations from Multiple Self-Supervised Tasks
Santiago Pascual, Mirco Ravanelli, Joan Serrà et al.
Learning Speaker Aware Offsets for Speaker Adaptation of Neural Networks
Leda Sarı, Samuel Thomas, Mark A. Hasegawa-Johnson
Learning Speaker Representations with Mutual Information
Mirco Ravanelli, Yoshua Bengio
Learning Temporal Clusters Using Capsule Routing for Speech Emotion Recognition
Md. Asif Jalal, Erfan Loweimi, Roger K. Moore et al.
Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning
Yu Zhang, Ron J. Weiss, Heiga Zen et al.
Learn Spelling from Teachers: Transferring Knowledge from Language Models to Sequence-to-Sequence Speech Recognition
Ye Bai, Jiangyan Yi, Jianhua Tao et al.
Leveraging a Character, Word and Prosody Triplet for an ASR Error Robust and Agglutination Friendly Punctuation Approach
György Szaszák, Máté Ákos Tündik
Leveraging Acoustic Cues and Paralinguistic Embeddings to Detect Expression from Voice
Vikramjit Mitra, Sue Booker, Erik Marchi et al.
Lexically Guided Perceptual Learning of a Vowel Shift in an Interactive L2 Listening Context
E. Felker, Mirjam Ernestus, Mirjam Broersma
LF-MMI Training of Bayesian and Gaussian Process Time Delay Neural Networks for Speech Recognition
Shoukang Hu, Xurong Xie, Shansong Liu et al.