Papers
mmPred: Radar-based Human Motion Prediction in the Dark
Junqiao Fan, Haocong Rao, Jiarui Zhang et al.
MM-R1: Unleashing the Power of Unified Multimodal Large Language Models for Personalized Image Generation
Qian Liang, Yujia Wu, Kuncheng Li et al.
MMRA: A Benchmark for Evaluating Multi-Granularity and Multi-Image Relational Association Capabilities in Large Visual Language Models
Siwei Wu, King Zhu, Yu Bai et al.
MMRAG-RFT: Two-stage Reinforcement Fine-tuning for Explainable Multi-modal Retrieval-augmented Generation
Shengwei Zhao, Jingwen Yao, Sitong Wei et al.
MMSciCode: Real-world Evaluation of Multilingual Multi-Discipline Scientific Research Coding
Xue Xia, Zheyuan Yang, Arman Cohan et al.
MMSearch-R1: Incentivizing LMMs to Search
Jinming Wu, Zihao Deng, Wei Li et al.
MM-StanceDet: Retrieval-Augmented Multi-modal Multi-agent Stance Detection
Weihai Lu, Zhejun Zhao, Yanshu Li et al.
MM-TS: Multi-Modal Temperature and Margin Schedules for Contrastive Learning with Long-Tail Data
Siarhei Sheludzko, Dhimitrios Duka, Bernt Schiele et al.
MMTutorBench: The First Multimodal Benchmark for AI Math Tutoring
Tengchao Yang, Sichen Guo, Mengzhao Jia et al.
MMUIE: Massive Multi-Domain Universal Information Extraction for Long Documents
Shuyi Zhang, Zhenbin Chen, Shuting Li et al.
mmWEAVER: Environment-Specific mmWave Signal Synthesis from a Photo and Activity Description
Mahathir Monjur, Shahriar Nirjon
Mnemis: Dual-Route Retrieval on Hierarchical Graphs for Long-Term LLM Memory
Zihao Tang, Xin Yu, Ziyu Xiao et al.
Mnemosyne: Accelerating Multi-Hop Question Answering via Cache Hit Order Fitting
Haizhou Du, Jiujiu Li, Dongyang Li et al.
MoA: Heterogeneous Mixture of Adapters for Parameter-Efficient Fine-Tuning of Large Language Models
Jie Cao, Tianwei Lin, Bo Yuan et al.
MoA: Mixture of Aggregators Improves Slide-Level Diagnosis in Computational Pathology
Fatih Ozlugedik, Muhammed Furkan Dasdelen, Rao Muhammad Umer et al.
MOA: Multi-Objective Alignment for Role-Playing Agents
Chonghua Liao, Ke Wang, Yuchuan Wu et al.
MOBA: A Material-Oriented Backdoor Attack Against LiDAR-Based 3D Object Detection Systems
Saket Sanjeev Chaturvedi, Gaurav Bagwe, Lan Emily Zhang et al.
MoBGS: Motion Deblurring Dynamic 3D Gaussian Splatting for Blurry Monocular Video
Minh-Quan Viet Bui, Jongmin Park, Juan Luis Gonzalez et al.
Mobile-Agent-RAG: Driving Smart Multi-Agent Coordination with Contextual Knowledge Empowerment for Long-Horizon Mobile Automation
Yuxiang Zhou, Jichang Li, Yanhao Zhang et al.
MobileCity: An Efficient Framework for Large-Scale Urban Behavior Simulation
Xiaotong Ye, Nicolas Bougie, Toshihiko Yamasaki et al.
MobileLLM-Flash: Latency-Guided On-Device LLM Design for Industry Scale Deployment
Hanxian Huang, Igor Fedorov, Andrey Gromov et al.
Mobile-Oriented Video Diffusion: Enabling Text-to-Video Generation on Mobile Devices Without Retraining, Compression, or Pruning
Bosung Kim, Kyuhwan Lee, Isu Jeong et al.
Mobile-R1: Towards Interactive Capability for VLM-Based Mobile Agent via Systematic Training
Jihao Gu, Qihang Ai, Yingyao Wang et al.
MobileSafetyBench: Evaluating Safety of Autonomous Agents in Mobile Device Control
Juyong Lee, Dongyoon Hahm, June Suk Choi et al.
MobileWorld: Benchmarking Autonomous Mobile Agents in Agent-User Interactive and MCP-Augmented Environments
Quyu Kong, Xu Zhang, Zhenyu Yang et al.