Papers
Knowledge Distillation for Streaming Transformer–Transducer
Atsushi Kojima
Knowledge Distillation from BERT Transformer to Speech Transformer for Intent Classification
Yidi Jiang, Bidisha Sharma, Maulik Madhavi et al.
Knowledge Distillation from Multi-Modality to Single-Modality for Person Verification
Leying Zhang, Zhengyang Chen, Yanmin Qian
Know Your Enemy, Know Yourself: A Unified Two-Stage Framework for Speech Enhancement
Wenzhe Liu, Andong Li, Yuxuan Ke et al.
kosp2e: Korean Speech to English Translation Corpus
Won Ik Cho, Seok Min Kim, Hyunchang Cho et al.
Label Embedding for Chinese Grapheme-to-Phoneme Conversion
Eunbi Choi, Hwa-Yeon Kim, Jong-Hwan Kim et al.
LACOPE: Latency-Constrained Pitch Estimation for Speech Enhancement
Hendrik Schröter, Tobias Rosenkranz, Alberto N. Escalante-B et al.
Lalilo: A Reading Assistant for Children Featuring Speech Recognition-Based Reading Mistake Detection
Corentin Hembise, Lucile Gelin, Morgane Daniel
Language and Speaker-Independent Feature Transformation for End-to-End Multilingual Speech Recognition
Tomoaki Hayakawa, Chee Siang Leow, Akio Kobayashi et al.
Language Modeling and Artificial Intelligence
Tomáš Mikolov
Language or Paralanguage, This is the Problem: Comparing Depressed and Non-Depressed Speakers Through the Analysis of Gated Multimodal Units
Nujud Aloshban, Anna Esposito, Alessandro Vinciarelli
Language Recognition Based on Unsupervised Pretrained Models
Haibin Yu, Jing Zhao, Song Yang et al.
Language Recognition on Unknown Conditions: The LORIA-Inria-MULTISPEECH System for AP20-OLR Challenge
Raphaël Duroselle, Md. Sahidullah, Denis Jouvet et al.
Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting Transcription with Single Distant Microphone
Naoyuki Kanda, Guoli Ye, Yu Wu et al.
Large-Scale Self- and Semi-Supervised Learning for Speech Translation
Changhan Wang, Anne Wu, Juan Pino et al.
Late Fusion of the Available Lexicon and Raw Waveform-Based Acoustic Modeling for Depression and Dementia Recognition
Esaú Villatoro-Tello, S. Pavankumar Dubagunta, Julian Fritsch et al.
Layer Pruning on Demand with Intermediate CTC
Jaesong Lee, Jingu Kang, Shinji Watanabe
Layer-Wise Fast Adaptation for End-to-End Multi-Accent Speech Recognition
Xun Gong, Yizhou Lu, Zhikai Zhou et al.
LEAP Submission for the Third DIHARD Diarization Challenge
Prachi Singh, Rajat Varma, Venkat Krishnamohan et al.
Learning a Neural Diff for Speech Models
Jonathan Macoskey, Grant P. Strimel, Ariya Rastrow
Learning Explicit Prosody Models and Deep Speaker Embeddings for Atypical Voice Conversion
Disong Wang, Songxiang Liu, Lifa Sun et al.
Learning Fine-Grained Cross Modality Excitement for Speech Emotion Recognition
Hang Li, Wenbiao Ding, Zhongqin Wu et al.
Learning Mutual Correlation in Multimodal Transformer for Speech Emotion Recognition
Yuhua Wang, Guang Shen, Yuezhu Xu et al.
Learning Robust Speech Representation with an Articulatory-Regularized Variational Autoencoder
Marc-Antoine Georges, Laurent Girin, Jean-Luc Schwartz et al.