Papers
MLLM Enriched Explainable Multiple Clustering
Shan Zhang, Liangrui Ren, Qiaoyu Tan et al.
M-Loss: Quantifying Model Merging Compatibility with Limited Unlabeled Data
Tiantong Wang, Yiyang Duan, Haoyu Chen et al.
MM4Rec: Multi-Source and Multi-Scenario Recommender for Unified User Preference
Chu-Chun Yu, Ming-Yi Hong, Miao-Chen Chiang et al.
MMAU-Pro: A Challenging and Comprehensive Benchmark for Holistic Evaluation of Audio General Intelligence
Sonal Kumar, Šimon Sedláček, Vaibhavi Lokegaonkar et al.
MMBERT: Scaled Mixture-of-Experts Multimodal BERT for Robust Chinese Hate Speech Detection Under Cloaking Perturbations
Qiyao Xue, Yuchen Dou, Zheyuan Ryan Shi et al.
MME-SCI: A Comprehensive and Challenging Science Benchmark for Multimodal Large Language Models
Jiacheng Ruan, Dan Jiang, Xian Gao et al.
MMG-Vid: Maximizing Marginal Gains at Segment-level and Token-level for Efficient Video LLMs
Junpeng Ma, Qizhe Zhang, Ming Lu et al.
MMG-VL: A Vision-Language Driven Approach for Multi-Person Motion Generation
Songyuan Yang, Wanrong Huang, Yinuo Liu et al.
MMhops-R1: Multimodal Multi-hop Reasoning
Tao Zhang, Ziqi Zhang, Zongyang Ma et al.
MMIFEvol: Towards Evolutionary Multimodal Instruction Following
Haoyu Wang, Sihang Jiang, Xiangru Zhu et al.
mmJEPA-ECG: Cross-Posture Robust Contactless Electrocardiogram Monitoring via Millimeter Wave Radar Sensing
Ziyang Liu, Siyuan He, Feng Liang et al.
MMMamba: A Versatile Cross-Modal in Context Fusion Framework for Pan-Sharpening and Zero-Shot Image Enhancement
Yingying Wang, Xuanhua He, Chen Wu et al.
MMPG: MoE-based Adaptive Multi-Perspective Graph Fusion for Protein Representation Learning
Yusong Wang, Jialun Shen, Zhihao Wu et al.
mmPred: Radar-based Human Motion Prediction in the Dark
Junqiao Fan, Haocong Rao, Jiarui Zhang et al.
MM-R1: Unleashing the Power of Unified Multimodal Large Language Models for Personalized Image Generation
Qian Liang, Yujia Wu, Kuncheng Li et al.
MMRAG-RFT: Two-stage Reinforcement Fine-tuning for Explainable Multi-modal Retrieval-augmented Generation
Shengwei Zhao, Jingwen Yao, Sitong Wei et al.
Mnemosyne: Accelerating Multi-Hop Question Answering via Cache Hit Order Fitting
Haizhou Du, Jiujiu Li, Dongyang Li et al.
MOBA: A Material-Oriented Backdoor Attack Against LiDAR-Based 3D Object Detection Systems
Saket Sanjeev Chaturvedi, Gaurav Bagwe, Lan Emily Zhang et al.
MoBGS: Motion Deblurring Dynamic 3D Gaussian Splatting for Blurry Monocular Video
Minh-Quan Viet Bui, Jongmin Park, Juan Luis Gonzalez et al.
Mobile-Agent-RAG: Driving Smart Multi-Agent Coordination with Contextual Knowledge Empowerment for Long-Horizon Mobile Automation
Yuxiang Zhou, Jichang Li, Yanhao Zhang et al.
MobileSafetyBench: Evaluating Safety of Autonomous Agents in Mobile Device Control
Juyong Lee, Dongyoon Hahm, June Suk Choi et al.
MoCast: Learning Turbulent Motions Under Physical Guidance for Precipitation Nowcasting
Binqing Wu, Weiqi Chen, Shiyu Liu et al.
MoCHA: Advanced Vision-Language Reasoning with MoE Connector and Hierarchical Group Attention
Yuqi Pang, Bowen Yang, Yun Cao et al.
Modality and Task Adaptation for Enhanced Zero-shot Composed Image Retrieval
Haiwen Li, Delong Liu, Zhaohui Hou et al.
Modality-Aware Bias Mitigation and Invariance Learning for Unsupervised Visible-Infrared Person Re-Identification
Menglin Wang, Xiaojin Gong, Jiachen Li et al.