Papers
Malafide: a novel adversarial convolutive noise attack against deepfake and spoofing detection systems
Michele Panariello, Wanying Ge, Hemlata Tak et al.
Mandarin Electrolaryngeal Speech Voice Conversion using Cross-domain Features
Hsin-Hao Chen, Yung-Lun Chien, Ming-Chi Yen et al.
Masked Audio Modeling with CLAP and Multi-Objective Learning
Yifei Xin, Xiulian Peng, Yan Lu
Masked Modeling Duo for Speech: Specializing General-Purpose Audio Representation to Speech using Denoising Distillation
Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi et al.
MaskedSpeech: Context-aware Speech Synthesis with Masking Strategy
Ya-Jie Zhang, Wei Song, Yanghao Yue et al.
Masking Kernel for Learning Energy-Efficient Representations for Speaker Recognition and Mobile Health
Apiwat Ditthapron, Emmanuel O. Agu, Adam C. Lammert
Matching Acoustic and Perceptual Measures of Phonation Assessment in Disordered Speech - A Case Study
Melanie Jouaiti, Pippa Kirby, Ravi Vaidyanathan
Matching Latent Encoding for Audio-Text based Keyword Spotting
Kumari Nishu, Minsik Cho, Devang Naik
MAVD: The First Open Large-Scale Mandarin Audio-Visual Dataset with Depth Information
Jianrong Wang, Yuchen Huo, Li Liu et al.
MCR-Data2vec 2.0: Improving Self-supervised Speech Pre-training via Model-level Consistency Regularization
Ji Won Yoon, Seok Min Kim, Nam Soo Kim
MC-SpEx: Towards Effective Speaker Extraction with Multi-Scale Interfusion and Conditional Speaker Modulation
Jun Chen, Wei Rao, Zilin Wang et al.
MD3: The Multi-Dialect Dataset of Dialogues
Jacob Eisenstein, Vinodkumar Prabhakaran, Clara Rivera et al.
mdctGAN: Taming transformer-based GAN for speech super-resolution with Modified DCT spectra
Chenhao Shuai, Chaohua Shi, Lu Gan et al.
Measuring Language Development From Child-centered Recordings
Yaya Sy, William N. Havard, Marvin Lavechin et al.
Measuring Phonological Precision in Children with Cleft Lip and Palate
Tomás Arias-Vergara, Elizabeth Londoño-Mora, Paula A. Pérez-Toro et al.
Measuring prosody in child speech using SoapBox Fluency API
Mauro Nicolao, Brenda McGuirk, Declan Moore et al.
MEG Encoding using Word Context Semantics in Listening Stories
Subba Reddy Oota, Nathan Trouvain, Frederic Alexandre et al.
Memory-augmented conformer for improved end-to-end long-form ASR
Carlos Carvalho, Alberto Abad
Memory Augmented Lookup Dictionary Based Language Modeling for Automatic Speech Recognition
Yukun Feng, Ming Tu, Rui Xia et al.
Memory Network-Based End-To-End Neural ES-KMeans for Improved Word Segmentation
Yu Iwamoto, Takahiro Shinozaki
MERLIon CCS Challenge: A English-Mandarin code-switching child-directed speech corpus for language identification and diarization
Victoria Y. H. Chua, Hexin Liu, Leibny Paola Garcia et al.
Meta-domain Adversarial Contrastive Learning for Alleviating Individual Bias in Self-sentiment Predictions
Zhi Li, Ryu Takeda, Takahiro Hara
MF-PAM: Accurate Pitch Estimation through Periodicity Analysis and Multi-level Feature Fusion
Woo-Jin Chung, Doyeon Kim, Soo-Whan Chung et al.