Papers
Mixed Diffusion for 3D Indoor Scene Synthesis
Siyi Hu, Diego Martín Arroyo, Stephanie Debats et al.
MixER: From Cross-Modal to Mixed-Modal Visible-Infrared Re-Identification
Mahdi Alehdaghi, Rajarshi Bhattacharya, Dai Yannick et al.
MixKVQ: Query-Aware Mixed-Precision KV Cache Quantization for Long-Context Reasoning
Tao Zhang, Ziqian Zeng, Hao Peng et al.
Mix-QSAM2: Mixed-Precision Quantization for High Fidelity Segmentation in Resource Constrained Scenarios
Yuzhe Duan, Xuanxuan Ren, Guizhe Dong et al.
MixtureKit: A General Framework for Composing, Training, and Visualizing Mixture-of-Experts Models
Ahmad Chamma, Omar El Herraoui, Guokan Shang
Mixture-of-Experts with Intermediate CTC Supervision for Accented Speech Recognition
Wonjun Lee, Hyounghun Kim, Gary Lee
Mixture of Heterogeneous Grouped Experts for Language Modeling
Zhicheng Ma, Xiang Liu, Zhaoxiang Liu et al.
Mixture-of-Minds: Multi-Agent Reinforcement Learning for Table Understanding
Yuhang Zhou, Mingrui Zhang, Ke Li et al.
Mixture of Ranks with Degradation-Aware Routing for One-Step Real-World Image Super-Resolution
Xiao He, Zhijun Tu, Kun Cheng et al.
Mixture-of-Trees: Learning to Select and Weigh Reasoning Paths for Efficient LLM Inference
Yangbo Wei, Zhen Huang, Shaoqiang Lu et al.
MizanQA: A Benchmark for Multi-Answer Moroccan Legal QA
Adil Bahaj, Mounir Ghogho
MLLM Enriched Explainable Multiple Clustering
Shan Zhang, Liangrui Ren, Qiaoyu Tan et al.
mllm-shap: A Shapley Value Explainability Platform for Text-Audio Multimodal Large Language Models
Jakub Muszyński, Paweł Pozorski, Maria Ganzha
M-Loss: Quantifying Model Merging Compatibility with Limited Unlabeled Data
Tiantong Wang, Yiyang Duan, Haoyu Chen et al.
MM4Rec: Multi-Source and Multi-Scenario Recommender for Unified User Preference
Chu-Chun Yu, Ming-Yi Hong, Miao-Chen Chiang et al.
MMAC: A Multilingual, Multimodal Alignment Framework for Cultural Grounding Evaluation
Weihua Zheng, Zhengyuan Liu, Tanmoy Chakraborty et al.
MMAU-Pro: A Challenging and Comprehensive Benchmark for Holistic Evaluation of Audio General Intelligence
Sonal Kumar, Šimon Sedláček, Vaibhavi Lokegaonkar et al.
MMBERT: Scaled Mixture-of-Experts Multimodal BERT for Robust Chinese Hate Speech Detection Under Cloaking Perturbations
Qiyao Xue, Yuchen Dou, Zheyuan Ryan Shi et al.
MM-BizRAG: Rethinking Multimodal Retrieval-Augmented Generation for General Purpose Enterprise Q&A
Hanoz Bhathena, Parin Rajesh Jhaveri, Rohan Mittal et al.
MMCLIP: Cross-Modal Attention Masked Modelling for Medical Language-Image Pre-Training
Biao Wu, Yutong Xie, Zeyu Zhang et al.
MMCM: Multimodality-aware Metric using Clustering-based Modes for Probabilistic Human Motion Prediction
Kyotaro Tokoro, Hiromu Taketsugu, Norimichi Ukita
MMErroR: A Benchmark for Erroneous Reasoning in Vision-Language Models
Yang Shi, Yifeng Xie, Minzhe Guo et al.
MME-SCI: A Comprehensive and Challenging Science Benchmark for Multimodal Large Language Models
Jiacheng Ruan, Dan Jiang, Xian Gao et al.
MMG-Vid: Maximizing Marginal Gains at Segment-level and Token-level for Efficient Video LLMs
Junpeng Ma, Qizhe Zhang, Ming Lu et al.
MMG-VL: A Vision-Language Driven Approach for Multi-Person Motion Generation
Songyuan Yang, Wanrong Huang, Yinuo Liu et al.