Papers
Learning neural audio features without supervision
Sarthak Yadav, Neil Zeghidour
Learning Noise-independent Speech Representation for High-quality Voice Conversion for Noisy Target Speakers
Liumeng Xue, Shan Yang, Na Hu et al.
Learning to rank with BERT-based confidence models in ASR rescoring
Ting-Wei Wu, I-Fan Chen, Ankur Gandhe
Learning Under Label Noise for Robust Spoken Language Understanding systems
Anoop Kumar, Pankaj Kumar Sharma, Aravind Illa et al.
Leveraging Acoustic Contextual Representation by Audio-textual Cross-modal Learning for Conversational ASR
Kun Wei, Yike Zhang, Sining Sun et al.
Leveraging Prosody for Punctuation Prediction of Spontaneous Speech
Yeonjin Cho, Sara Ng, Trang Tran et al.
Leveraging Pseudo-labeled Data to Improve Direct Speech-to-Speech Translation
Qianqian Dong, Fengpeng Yue, Tom Ko et al.
Leveraging Real Conversational Data for Multi-Channel Continuous Speech Separation
Xiaofei Wang, Dongmei Wang, Naoyuki Kanda et al.
Leveraging Simultaneous Translation for Enhancing Transcription of Low-resource Language via Cross Attention Mechanism
Kak Soky, Sheng Li, Masato Mimura et al.
Leveraging Symmetrical Convolutional Transformer Networks for Speech to Singing Voice Style Transfer
Shrutina Agarwal, Naoya Takahashi, Sriram Ganapathy
Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translation
Ye Jia, Yifan Ding, Ankur Bapna et al.
Lexical stress in Spanish word segmentation
Alvaro Martin Iturralde Zurita, Meghan Clayards
LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
Rui Wang, Qibing Bai, Junyi Ao et al.
Lightweight Full-band and Sub-band Fusion Network for Real Time Speech Enhancement
Zhuangqi Chen, Pingjian Zhang
Light-Weight Speaker Verification with Global Context Information
MISEUL KIM, ZHENYU PIAO, Seyun Um et al.
Linguistic-Acoustic Similarity Based Accent Shift for Accent Recognition
Qijie Shao, Jinghao Yan, Jian Kang et al.
Linguistically Informed Post-processing for ASR Error correction in Sanskrit
Rishabh Kumar, Devaraja Adiga, Rishav Ranjan et al.
Linguistic versus biological factors governing acoustic voice variation
Yoonjeong Lee, Jody Kreiman
Listen, Adapt, Better WER: Source-free Single-utterance Test-time Adaptation for Automatic Speech Recognition
Guan-Ting Lin, Shang-Wen Li, Hung-yi Lee
Listening with Googlears: Low-Latency Neural Multiframe Beamforming and Equalization for Hearing Aids
Samuel Yang, Scott Wisdom, Chet Gnegy et al.
Listen only to me! How well can target speech extraction handle false alarms?
Marc Delcroix, Keisuke Kinoshita, Tsubasa Ochiai et al.
Local Context-aware Self-attention for Continuous Sign Language Recognition
Ronglai Zuo, Brian Mak
Lombard Effect for Bilingual Speakers in Cantonese and English: importance of spectro-temporal features
Maximilian Karl Scharf, Sabine Hochmuth, Lena L.N. Wong et al.
Low-bit Shift Network for End-to-End Spoken Language Understanding
Anderson R. Avila, Khalil Bibi, Rui Heng Yang et al.
Low-complex and Highly-performed Binary Residual Neural Network for Small-footprint Keyword Spotting
Xiao Wang, Song Cheng, Jun Li et al.