Papers
Mastering Robot Manipulation with Multimodal Prompts through Pretraining and Multi-task Fine-tuning
Jiachen Li, Qiaozi Gao, Michael Johnston et al.
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs
Ling Yang, Zhaochen Yu, Chenlin Meng et al.
Mastering Zero-Shot Interactions in Cooperative and Competitive Simultaneous Games
Yannik Mahlau, Frederik Schubert, Bodo Rosenhahn
MathScale: Scaling Instruction Tuning for Mathematical Reasoning
Zhengyang Tang, Xingxing Zhang, Benyou Wang et al.
Matrix Information Theory for Self-Supervised Learning
Yifan Zhang, Zhiquan Tan, Jingqin Yang et al.
Matroid Semi-Bandits in Sublinear Time
Ruo-Chun Tzeng, Naoto Ohsaka, Kaito Ariu
MaxMin-RLHF: Alignment with Diverse Human Preferences
Souradip Chakraborty, Jiahao Qiu, Hui Yuan et al.
MC-GTA: Metric-Constrained Model-Based Clustering using Goodness-of-fit Tests with Autocorrelations
Zhangyu Wang, Gengchen Mai, Krzysztof Janowicz et al.
MD tree: a model-diagnostic tree grown on loss landscape
Yefan Zhou, Jianlong Chen, Qinxue Cao et al.
Mean Estimation in the Add-Remove Model of Differential Privacy
Alex Kulesza, Ananda Theertha Suresh, Yuyan Wang
Mean-field Analysis on Two-layer Neural Networks from a Kernel Perspective
Shokichi Takakura, Taiji Suzuki
Mean-field Chaos Diffusion Models
Sungwoo Park, Dongjun Kim, Ahmed Alaa
Mean Field Langevin Actor-Critic: Faster Convergence and Global Optimality beyond Lazy Learning
Kakei Yamamoto, Kazusato Oko, Zhuoran Yang et al.
Mean-field Underdamped Langevin Dynamics and its Spacetime Discretization
Qiang Fu, Ashia Camage Wilson
Measures of diversity and space-filling designs for categorical data
Cedric Malherbe, Emilio Domı́nguez-Sánchez, Merwan Barlier et al.
Measuring Stochastic Data Complexity with Boltzmann Influence Functions
Nathan Hoyen Ng, Roger Baker Grosse, Marzyeh Ghassemi
Mechanistic Design and Scaling of Hybrid Architectures
Michael Poli, Armin W Thomas, Eric Nguyen et al.
Mechanistic Neural Networks for Scientific Machine Learning
Adeel Pervez, Francesco Locatello, Stratis Gavves
Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads
Tianle Cai, Yuhong Li, Zhengyang Geng et al.
Membership Inference Attacks on Diffusion Models via Quantile Regression
Shuai Tang, Steven Wu, Sergul Aydore et al.
Memoria: Resolving Fateful Forgetting Problem through Human-Inspired Memory Architecture
Sangjun Park, Jinyeong Bak
Memorization Through the Lens of Curvature of Loss Function Around Samples
Isha Garg, Deepak Ravikumar, Kaushik Roy
Memory Consolidation Enables Long-Context Video Understanding
Ivana Balazevic, Yuge Shi, Pinelopi Papalampidi et al.
Memory Efficient Neural Processes via Constant Memory Attention Block
Leo Feng, Frederick Tung, Hossein Hajimirsadeghi et al.
MEMORYLLM: Towards Self-Updatable Large Language Models
Yu Wang, Yifan Gao, Xiusi Chen et al.