Papers
MODULI: Unlocking Preference Generalization via Diffusion Models for Offline Multi-Objective Reinforcement Learning
Yifu Yuan, Zhenrui Zheng, Zibin Dong et al.
MoEQuant: Enhancing Quantization for Mixture-of-Experts Large Language Models via Expert-Balanced Sampling and Affinity Guidance
Zhixuan Chen, Xing Hu, Dawei Yang et al.
MoE-SVD: Structured Mixture-of-Experts LLMs Compression via Singular Value Decomposition
Wei Li, Lujun Li, Hao Gu et al.
MOGIC: Metadata-infused Oracle Guidance for Improved Extreme Classification
Suchith Chidananda Prabhu, Bhavyajeet Singh, Anshul Mittal et al.
MoHAVE: Mixture of Hierarchical Audio-Visual Experts for Robust Speech Recognition
Sungnyun Kim, Kangwook Jang, Sangmin Bae et al.
MoH: Multi-Head Attention as Mixture-of-Head Attention
Peng Jin, Bo Zhu, Li Yuan et al.
Moirai-MoE: Empowering Time Series Foundation Models with Sparse Mixture of Experts
Xu Liu, Juncheng Liu, Gerald Woo et al.
MoMa: Modulating Mamba for Adapting Image Foundation Models to Video Recognition
Yuhuan Yang, Chaofan Ma, Zhenjie Mao et al.
Momentum-Driven Adaptivity: Towards Tuning-Free Asynchronous Federated Learning
Wenjing Yan, Xiangyu Zhong, Xiaolu Wang et al.
MONA: Myopic Optimization with Non-myopic Approval Can Mitigate Multi-step Reward Hacking
Sebastian Farquhar, Vikrant Varma, David Lindner et al.
Monte Carlo Tree Diffusion for System 2 Planning
Jaesik Yoon, Hyeonseo Cho, Doojin Baek et al.
Monte Carlo Tree Search for Comprehensive Exploration in LLM-Based Automatic Heuristic Design
Zhi Zheng, Zhuoliang Xie, Zhenkun Wang et al.
Monte-Carlo Tree Search with Uncertainty Propagation via Optimal Transport
Tuan Quang Dam, Pascal Stenger, Lukas Schneider et al.
MoRAgent: Parameter Efficient Agent Tuning with Mixture-of-Roles
Jing Han, Binwei Yan, Tianyu Guo et al.
More Than Meets the Eye: Enhancing Multi-Object Tracking Even with Prolonged Occlusions
Bishoy Galoaa, Somaieh Amraee, Sarah Ostadabbas
Morse: Dual-Sampling for Lossless Acceleration of Diffusion Models
Chao Li, Jiawei Fan, Anbang Yao
MP-Nav: Enhancing Data Poisoning Attacks against Multimodal Learning
Jingfeng Zhang, Prashanth Krishnamurthy, Naman Patel et al.
MPO: An Efficient Post-Processing Framework for Mixing Diverse Preference Alignment
Tianze Wang, Dongnan Gui, Yifan Hu et al.
MTL-UE: Learning to Learn Nothing for Multi-Task Learning
Yi Yu, Song Xia, Siyuan Yang et al.
MTSTRec: Multimodal Time-Aligned Shared Token Recommender
Ming-Yi Hong, Yen-Jung Hsu, Miao-Chen Chiang et al.
MUDDFormer: Breaking Residual Bottlenecks in Transformers via Multiway Dynamic Dense Connections
Da Xiao, Qingye Meng, Shengping Li et al.
MuLan: Adapting Multilingual Diffusion Models for Hundreds of Languages with Negligible Cost
Sen Xing, Muyan Zhong, Zeqiang Lai et al.
Multiaccuracy and Multicalibration via Proxy Groups
Beepul Bharti, Mary Versa Clemens-Sewall, Paul Yi et al.
Multi-agent Architecture Search via Agentic Supernet
Guibin Zhang, Luyang Niu, Junfeng Fang et al.
Multi-Armed Bandits with Interference: Bridging Causal Inference and Adversarial Bandits
Su Jia, Peter I. Frazier, Nathan Kallus