Papers
Lightweight Online Noise Reduction on Embedded Devices Using Hierarchical Recurrent Neural Networks
H. Schröter, T. Rosenkranz, A.N. Escalante-B. et al.
Links Between Production and Perception of Glottalisation in Individual Australian English Speaker/Listeners
Joshua Penney, Felicity Cox, Anita Szakay
Lip Graph Assisted Audio-Visual Speech Recognition Using Bidirectional Synchronous Fusion
Hong Liu, Zhan Chen, Bing Yang
Listen Attentively, and Spell Once: Whole Sentence Generation via a Non-Autoregressive Architecture for Low-Latency Speech Recognition
Ye Bai, Jiangyan Yi, Jianhua Tao et al.
Listen to What You Want: Neural Network-Based Universal Sound Selector
Tsubasa Ochiai, Marc Delcroix, Yuma Koizumi et al.
Lite Audio-Visual Speech Enhancement
Shang-Yi Chuang, Yu Tsao, Chen-Chou Lo et al.
Low Latency Auditory Attention Detection with Common Spatial Pattern Analysis of EEG Signals
Siqi Cai, Enze Su, Yonghao Song et al.
Low Latency End-to-End Streaming Speech Recognition with a Scout Network
Chengyi Wang, Yu Wu, Liang Lu et al.
Low-Latency Sequence-to-Sequence Speech Recognition and Translation by Partial Hypothesis Selection
Danni Liu, Gerasimos Spanakis, Jan Niehues
Low-Latency Single Channel Speech Dereverberation Using U-Net Convolutional Neural Networks
Ahmet E. Bulut, Kazuhito Koishida
Low Latency Speech Recognition Using End-to-End Prefetching
Shuo-Yiin Chang, Bo Li, David Rybach et al.
LVCSR with Transformer Language Models
Eugen Beck, Ralf Schlüter, Hermann Ney
Making a Distinction Between Schizophrenia and Bipolar Disorder Based on Temporal Parameters in Spontaneous Speech
Gábor Gosztolya, Anita Bagi, Szilvia Szalóki et al.
Malayalam-English Code-Switched: Grapheme to Phoneme System
Sreeja Manghat, Sreeram Manghat, Tanja Schultz
Mandarin and English Adults’ Cue-Weighting of Lexical Stress
Zhen Zeng, Karen Mattock, Liquan Liu et al.
Mandarin Lexical Tones: A Corpus-Based Study of Word Length, Syllable Position and Prosodic Position on Duration
Yaru Wu, Martine Adda-Decker, Lori Lamel
Mask CTC: Non-Autoregressive End-to-End ASR with CTC and Mask Predict
Yosuke Higuchi, Shinji Watanabe, Nanxin Chen et al.
Massively Multilingual ASR: 50 Languages, 1 Model, 1 Billion Parameters
Vineel Pratap, Anuroop Sriram, Paden Tomasello et al.
MatchboxNet: 1D Time-Channel Separable Convolutional Neural Network Architecture for Speech Commands Recognition
Somshubra Majumdar, Boris Ginsburg
Memory Controlled Sequential Self Attention for Sound Recognition
Arjun Pankajakshan, Helen L. Bear, Vinod Subramanian et al.
Mentoring-Reverse Mentoring for Unsupervised Multi-Channel Speech Source Separation
Yu Nakagome, Masahito Togami, Tetsuji Ogawa et al.
Metadata-Aware End-to-End Keyword Spotting
Hongyi Liu, Apurva Abhyankar, Yuriy Mishchenko et al.
Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs
Seong Min Kye, Youngmoon Jung, Hae Beom Lee et al.