Papers
Low-bit Shift Network for End-to-End Spoken Language Understanding
Anderson R. Avila, Khalil Bibi, Rui Heng Yang et al.
Low-complex and Highly-performed Binary Residual Neural Network for Small-footprint Keyword Spotting
Xiao Wang, Song Cheng, Jun Li et al.
Low-data? No problem: low-resource, language-agnostic conversational text-to-speech via F0-conditioned data augmentation
Giulia Comini, Goeric Huybrechts, Manuel Sam Ribeiro et al.
Low-Latency Online Streaming VideoQA Using Audio-Visual Transformers
Chiori Hori, Takaaki Hori, Jonathan Le Roux
Low-Level Physiological Implications of End-to-End Learning for Speech Recognition
Louise Coppieters de Gibson, Philip N. Garner
Low-resource Accent Classification in Geographically-proximate Settings: A Forensic and Sociophonetics Perspective
Qingcheng Zeng, Dading Chong, Peilin Zhou et al.
Low Resource Comparison of Attention-based and Hybrid ASR Exploiting wav2vec 2.0
Aku Rouhe, Anja Virkkunen, Juho Leinonen et al.
Low-resource Low-footprint Wake-word Detection using Knowledge Distillation
Arindam Ghosh, Mark Fuhs, Deblin Bagchi et al.
M-Adapter: Modality Adaptation for End-to-End Speech-to-Text Translation
Jinming Zhao, Hao Yang, Gholamreza Haffari et al.
MAE-AST: Masked Autoencoding Audio Spectrogram Transformer
Alan Baade, Puyuan Peng, David Harwath
MAESTRO: Matched Speech Text Representations through Modality Matching
Zhehuai Chen, Yu Zhang, Andrew Rosenberg et al.
Mandarin Lombard Grid: a Lombard-grid-like corpus of Standard Chinese
Yuhong Yang, Xufeng Chen, Qingmu Liu et al.
Mandarin Tone Sandhi Realization: Evidence from Large Speech Corpora
Zuoyu Tian, Xiao Dong, Feier Gao et al.
MBI-Net: A Non-Intrusive Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids
Ryandhimas Edo Zezario, Fei Chen, Chiou-Shann Fuh et al.
mcBERT: Momentum Contrastive Learning with BERT for Zero-Shot Slot Filling
Seong-Hwan Heo, WonKee Lee, Jong-Hyeok Lee
Membership Inference Attacks Against Self-supervised Speech Models
Wei-Cheng Tseng, Wei-Tsung Kao, Hung-yi Lee
Memory-Efficient Multi-Step Speech Enhancement with Neural ODE
Jen-Hung Huang, Chung-Hsien Wu
Memory-Efficient Training of RNN-Transducer with Sampled Softmax
Jaesong Lee, Lukas Lee, Shinji Watanabe
Meta Auxiliary Learning for Low-resource Spoken Language Understanding
Yingying Gao, Junlan Feng, Chao Deng et al.
Method for improving the word intelligibility of presented speech using bone-conduction headphones
Teruki Toya, Wenyu Zhu, Maori Kobayashi et al.
MFA-Conformer: Multi-scale Feature Aggregation Conformer for Automatic Speaker Verification
Yang Zhang, Zhiqiang Lv, Haibin Wu et al.
Microphone Array Channel Combination Algorithms for Overlapped Speech Detection
Theo Mariotte, Anthony Larcher, Silvio Montrésor et al.
MIM-DG: Mutual information minimization-based domain generalization for speaker verification
Woohyun Kang, Md Jahangir Alam, Abderrahim Fathan
MIMO-DoAnet: Multi-channel Input and Multiple Outputs DoA Network with Unknown Number of Sound Sources
Haoran Yin, Meng Ge, Yanjie Fu et al.