Papers
Modeling Dialectal Variation for Swiss German Automatic Speech Recognition
Abbas Khosravani, Philip N. Garner, Alexandros Lazaridis
Modeling Dysphonia Severity as a Function of Roughness and Breathiness Ratings in the GRBAS Scale
Carlos A. Ferrer, Efren Aragón, María E. Hdez-Díaz et al.
Modeling Sensorimotor Adaptation in Speech Through Alterations to Forward and Inverse Models
Taijing Chen, Adam Lammert, Benjamin Parrell
Modeling the Effect of Military Oxygen Masks on Speech Characteristics
Benjamin Elie, Jodie Gauvain, Jean-Luc Gauvain et al.
Models of Reaction Times in Auditory Lexical Decision: RTonset versus RToffset
Sophie Brand, Kimberley Mulder, Louis ten Bosch et al.
Modular Multi-Modal Attention Network for Alzheimer’s Disease Detection Using Patient Audio and Language Data
Ning Wang, Yupeng Cao, Shuai Hao et al.
Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition
Yosuke Higuchi, Niko Moritz, Jonathan Le Roux et al.
MoM: Minutes of Meeting Bot
Benjamin Milde, Tim Fischer, Steffen Remus et al.
MUCS 2021: Multilingual and Code-Switching ASR Challenges for Low Resource Indian Languages
Anuj Diwan, Rakesh Vaideeswaran, Sanket Shah et al.
Multi-Attentive Detection of the Spider Monkey Whinny in the (Actual) Wild
Georgios Rizos, Jenna Lawson, Zhuoda Han et al.
Multi-Channel Opus Compression for Far-Field Automatic Speech Recognition with a Fixed Bitrate Budget
Lukas Drude, Jahn Heymann, Andreas Schwarz et al.
Multi-Channel Speaker Verification for Single and Multi-Talker Speech
Saurabh Kataria, Shi-Xiong Zhang, Dong Yu
Multi-Channel Transformer Transducer for Speech Recognition
Feng-Ju Chang, Martin Radfar, Athanasios Mouchtaris et al.
Multi-Channel VAD for Transcription of Group Discussion
Osamu Ichikawa, Kaito Nakano, Takahiro Nakayama et al.
Multi-Domain Knowledge Distillation via Uncertainty-Matching for End-to-End ASR Models
Ho-Gyeong Kim, Min-Joong Lee, Hoshik Lee et al.
Multi-Encoder Learning and Stream Fusion for Transformer-Based End-to-End Automatic Speech Recognition
Timo Lohrenz, Zhengyang Li, Tim Fingscheidt
Multi-Level Transfer Learning from Near-Field to Far-Field Speaker Verification
Li Zhang, Qing Wang, Kong Aik Lee et al.
Multilingual Speech Evaluation: Case Studies on English, Malay and Tamil
Huayun Zhang, Ke Shi, Nancy F. Chen
Multilingual Transfer of Acoustic Word Embeddings Improves When Training on Languages Related to the Target Zero-Resource Language
Christiaan Jacobs, Herman Kamper
Multimodal Sentiment Analysis with Temporal Modality Attention
Fan Qian, Jiqing Han
Multimodal Speech Summarization Through Semantic Concept Learning
Shruti Palaskar, Ruslan Salakhutdinov, Alan W. Black et al.
Multi-Mode Transformer Transducer with Stochastic Future Context
Kwangyoun Kim, Felix Wu, Prashant Sridhar et al.
Multiple Softmax Architecture for Streaming Multilingual End-to-End ASR Systems
Vikas Joshi, Amit Das, Eric Sun et al.
Multiple Sound Source Localization Based on Interchannel Phase Differences in All Frequencies with Spectral Masks
Hyungchan Song, Jong Won Shin
Multi-Speaker ASR Combining Non-Autoregressive Conformer CTC and Conditional Speaker Chain
Pengcheng Guo, Xuankai Chang, Shinji Watanabe et al.