Papers
Mixed-curvature decision trees and random forests
Philippe Chlenski, Quentin Chu, Raiyan R. Khan et al.
MixMin: Finding Data Mixtures via Convex Minimization
Anvith Thudi, Evianne Rovers, Yangjun Ruan et al.
Mixture of Experts Made Intrinsically Interpretable
Xingyi Yang, Constantin Venhoff, Ashkan Khakzar et al.
Mixture of Experts Provably Detect and Learn the Latent Cluster Structure in Gradient-Based Learning
Ryotaro Kawata, Kohsei Matsutani, Yuri Kinoshita et al.
Mixture of Hidden-Dimensions: Not All Hidden-States’ Dimensions are Needed in Transformer
Yilong Chen, Junyuan Shang, Zhenyu Zhang et al.
Mixture of Lookup Experts
Shibo Jie, Yehui Tang, Kai Han et al.
ML$^2$-GCL: Manifold Learning Inspired Lightweight Graph Contrastive Learning
Jianqing Liang, Zhiqiang Li, Xinkai Wei et al.
MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency
Dongzhi Jiang, Renrui Zhang, Ziyu Guo et al.
MMedPO: Aligning Medical Vision-Language Models with Clinical-Aware Multimodal Preference Optimization
Kangyu Zhu, Peng Xia, Yun Li et al.
MMInference: Accelerating Pre-filling for Long-Context Visual Language Models via Modality-Aware Permutation Sparse Attention
Yucheng Li, Huiqiang Jiang, Chengruidong Zhang et al.
MM-RLHF: The Next Step Forward in Multimodal LLM Alignment
Yifan Zhang, Tao Yu, Haochen Tian et al.
Modalities Contribute Unequally: Enhancing Medical Multi-modal Learning through Adaptive Modality Token Re-balancing
Jie Peng, Jenna L. Ballard, Mohan Zhang et al.
MODA: MOdular Duplex Attention for Multimodal Perception, Cognition, and Emotion Understanding
Zhicheng Zhang, Wuyou Xia, Chenxi Zhao et al.
Model-Based Exploration in Monitored Markov Decision Processes
Alireza Kazemipour, Matthew E. Taylor, Michael Bowling
Model Immunization from a Condition Number Perspective
Amber Yijia Zheng, Site Bai, Brian Bullins et al.
Modeling All-Atom Glycan Structures via Hierarchical Message Passing and Multi-Scale Pre-training
Minghao Xu, Jiaze Song, Keming Wu et al.
Modeling Multi-Task Model Merging as Adaptive Projective Gradient Descent
Yongxian Wei, Anke Tang, Li Shen et al.
Models of Heavy-Tailed Mechanistic Universality
Liam Hodgkinson, Zhichao Wang, Michael W. Mahoney
Model Steering: Learning with a Reference Model Improves Generalization Bounds and Scaling Laws
Xiyuan Wei, Ming Lin, Fanjiang Ye et al.
Model Swarms: Collaborative Search to Adapt LLM Experts via Swarm Intelligence
Shangbin Feng, Zifeng Wang, Yike Wang et al.
Model Uncertainty Quantification by Conformal Prediction in Continual Learning
Rui Gao, Weiwei Liu
Modified K-means Algorithm with Local Optimality Guarantees
Mingyi Li, Michael R. Metel, Akiko Takeda
Modular Duality in Deep Learning
Jeremy Bernstein, Laker Newhouse
Modularized Self-Reflected Video Reasoner for Multimodal LLM with Application to Video Question Answering
Zihan Song, Xin Wang, Zi Qian et al.
Modulated Diffusion: Accelerating Generative Modeling with Modulated Quantization
Weizhi Gao, Zhichao Hou, Junqi Yin et al.