Papers
MAS-GPT: Training LLMs to Build LLM-based Multi-Agent Systems
Rui Ye, Shuo Tang, Rui Ge et al.
Masked Autoencoders Are Effective Tokenizers for Diffusion Models
Hao Chen, Yujin Han, Fangyi Chen et al.
Masked Generative Nested Transformers with Decode Time Scaling
Sahil Goyal, Debapriya Tula, Gagan Jain et al.
Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More
Xialie Zhuang, Zhikai Jia, Jianjin Li et al.
MaskTwins: Dual-form Complementary Masking for Domain-Adaptive Image Segmentation
Jiawen Wang, Yinda Chen, Xiaoyu Liu et al.
Massive Values in Self-Attention Modules are the Key to Contextual Knowledge Understanding
Mingyu Jin, Kai Mei, Wujiang Xu et al.
MASS: Mathematical Data Selection via Skill Graphs for Pretraining Large Language Models
Jiazheng Li, Lu Yu, Qing Cui et al.
Mastering Board Games by External and Internal Planning with Language Models
John Schultz, Jakub Adamek, Matej Jusup et al.
Mastering Massive Multi-Task Reinforcement Learning via Mixture-of-Expert Decision Transformer
Yilun Kong, Guozheng Ma, Qi Zhao et al.
Mastering Multiple-Expert Routing: Realizable $H$-Consistency and Strong Guarantees for Learning to Defer
Anqi Mao, Mehryar Mohri, Yutao Zhong
MathConstruct: Challenging LLM Reasoning with Constructive Proofs
Mislav Balunovic, Jasper Dekoninck, Nikola Jovanović et al.
MATH-Perturb: Benchmarking LLMs’ Math Reasoning Abilities against Hard Perturbations
Kaixuan Huang, Jiacheng Guo, Zihao Li et al.
Matrix Completion with Incomplete Side Information via Orthogonal Complement Projection
Gengshuo Chang, Wei Zhang, Lehan Zhang
Matryoshka Quantization
Pranav Ajit Nair, Puranjay Datta, Jeff Dean et al.
MATS: An Audio Language Model under Text-only Supervision
Wen Wang, Ruibing Hou, Hong Chang et al.
Maximal Update Parametrization and Zero-Shot Hyperparameter Transfer for Fourier Neural Operators
Shanda Li, Shinjae Yoo, Yiming Yang
Maximizing Intermediate Checkpoint Value in LLM Pretraining with Bayesian Optimization
Deyuan Liu, Zecheng Wang, Bingning Wang et al.
Maximum Coverage in Turnstile Streams with Applications to Fingerprinting Measures
Alina Ene, Alessandro Epasto, Vahab Mirrokni et al.
Maximum Entropy Reinforcement Learning with Diffusion Policy
Xiaoyi Dong, Jian Cheng, Xi Sheryl Zhang
Maximum Total Correlation Reinforcement Learning
Bang You, Puze Liu, Huaping Liu et al.
MCU: An Evaluation Framework for Open-Ended Game Agents
Xinyue Zheng, Haowei Lin, Kaichen He et al.
MDDM: Practical Message-Driven Generative Image Steganography Based on Diffusion Models
Zihao Xu, Dawei Xu, Zihan Li et al.
Measuring Diversity: Axioms and Challenges
Mikhail Mironov, Liudmila Prokhorenkova
Measuring Diversity in Synthetic Datasets
Yuchang Zhu, Huizhe Zhang, Bingzhe Wu et al.
Measuring In-Context Computation Complexity via Hidden State Prediction
Vincent Herrmann, Róbert Csordás, Jürgen Schmidhuber