Papers
Meta-Learning for Speech Emotion Recognition Considering Ambiguity of Emotion Labels
Takuya Fujioka, Takeshi Homma, Kenji Nagamatsu
Meta Multi-Task Learning for Speech Emotion Recognition
Ruichu Cai, Kaibin Guo, Boyan Xu et al.
Metric Learning Loss Functions to Reduce Domain Mismatch in the x-Vector Space for Language Recognition
Raphaël Duroselle, Denis Jouvet, Irina Illina
Microphone Array Post-Filter for Target Speech Enhancement Without a Prior Information of Point Interferers
Guanjun Li, Shan Liang, Shuai Nie et al.
Microprosodic Variability in Plosives in German and Austrian German
Margaret Zellers, Barbara Schuppler
Minimum Bayes Risk Training of RNN-Transducer for End-to-End Speech Recognition
Chao Weng, Chengzhu Yu, Jia Cui et al.
MIRNet: Learning Multiple Identities Representations in Overlapped Speech
Hyewon Han, Soo-Whan Chung, Hong-Goo Kang
Mixed Case Contextual ASR Using Capitalization Masks
Diamantino Caseiro, Pat Rondon, Quoc-Nam Le The et al.
Mixtures of Deep Neural Experts for Automated Speech Scoring
Sara Papi, Edmondo Trentin, Roberto Gretter et al.
MLNET: An Adaptive Multiple Receptive-Field Attention Neural Network for Voice Activity Detection
Zhenpeng Zheng, Jianzong Wang, Ning Cheng et al.
MLS: A Large-Scale Multilingual Dataset for Speech Research
Vineel Pratap, Qiantong Xu, Anuroop Sriram et al.
Mobile-Assisted Prosody Training for Limited English Proficiency: Learner Background and Speech Learning Pattern
Kevin Hirschi, Okim Kang, Catia Cucchiarini et al.
MoBoAligner: A Neural Alignment Model for Non-Autoregressive TTS with Monotonic Boundary Search
Naihan Li, Shujie Liu, Yanqing Liu et al.
Modeling ASR Ambiguity for Neural Dialogue State Tracking
Vaishali Pal, Fabien Guillot, Manish Shrivastava et al.
Modeling Global Body Configurations in American Sign Language
Nicholas Wilkins, Max Cordes Galbraith, Ifeoma Nwogu
Monolingual Data Selection Analysis for English-Mandarin Hybrid Code-Switching Speech Recognition
Haobo Zhang, Haihua Xu, Van Tung Pham et al.
Multi-Encoder-Decoder Transformer for Code-Switching Speech Recognition
Xinyuan Zhou, Emre Yılmaz, Yanhua Long et al.
Multilingual Acoustic and Language Modeling for Ethio-Semitic Languages
Solomon Teferra Abate, Martha Yifiru Tachbelie, Tanja Schultz
Multilingual Jointly Trained Acoustic and Written Word Embeddings
Yushi Hu, Shane Settle, Karen Livescu
Multilingual Speech Recognition Using Language-Specific Phoneme Recognition as Auxiliary Task for Indian Languages
Hardik B. Sailor, Thomas Hain
Multilingual Speech Recognition with Self-Attention Structured Parameterization
Yun Zhu, Parisa Haghani, Anshuman Tripathi et al.
Multimodal Association for Speaker Verification
Suwon Shon, James Glass
Multi-Modal Attention for Speech Emotion Recognition
Zexu Pan, Zhaojie Luo, Jichen Yang et al.
Multimodal Deception Detection Using Automatically Extracted Acoustic, Visual, and Lexical Features
Jiaxuan Zhang, Sarah Ita Levitan, Julia Hirschberg