Papers
MMHOI: Modeling Complex 3D Multi-Human Multi-Object Interactions
Kaen Kogashi, Anoop Cherian, Meng-Yu Jennifer Kuo
MMhops-R1: Multimodal Multi-hop Reasoning
Tao Zhang, Ziqi Zhang, Zongyang Ma et al.
MMIFEvol: Towards Evolutionary Multimodal Instruction Following
Haoyu Wang, Sihang Jiang, Xiangru Zhu et al.
M-MiniGPT4: Multilingual VLLM Alignment via Translated Data
Seung Hun Eddie Han, Youssef Mohamed, Mohamed Elhoseiny
mmJEPA-ECG: Cross-Posture Robust Contactless Electrocardiogram Monitoring via Millimeter Wave Radar Sensing
Ziyang Liu, Siyuan He, Feng Liang et al.
MM-JudgeBias: A Benchmark for Evaluating Compositional Biases in MLLM-as-a-Judge
Sua Lee, Sanghee Park, Jinbae Im
MMMamba: A Versatile Cross-Modal in Context Fusion Framework for Pan-Sharpening and Zero-Shot Image Enhancement
Yingying Wang, Xuanhua He, Chen Wu et al.
MMPG: MoE-based Adaptive Multi-Perspective Graph Fusion for Protein Representation Learning
Yusong Wang, Jialun Shen, Zhihao Wu et al.
MM-PoisonRAG: Disrupting Multimodal RAG with Local and Global Knowledge Poisoning Attacks
Hyeonjeong Ha, Qiusi Zhan, Jeonghwan Kim et al.
mmPred: Radar-based Human Motion Prediction in the Dark
Junqiao Fan, Haocong Rao, Jiarui Zhang et al.
MM-R1: Unleashing the Power of Unified Multimodal Large Language Models for Personalized Image Generation
Qian Liang, Yujia Wu, Kuncheng Li et al.
MMRA: A Benchmark for Evaluating Multi-Granularity and Multi-Image Relational Association Capabilities in Large Visual Language Models
Siwei Wu, King Zhu, Yu Bai et al.
MMRAG-RFT: Two-stage Reinforcement Fine-tuning for Explainable Multi-modal Retrieval-augmented Generation
Shengwei Zhao, Jingwen Yao, Sitong Wei et al.
MMSciCode: Real-world Evaluation of Multilingual Multi-Discipline Scientific Research Coding
Xue Xia, Zheyuan Yang, Arman Cohan et al.
MMSearch-R1: Incentivizing LMMs to Search
Jinming Wu, Zihao Deng, Wei Li et al.
MM-StanceDet: Retrieval-Augmented Multi-modal Multi-agent Stance Detection
Weihai Lu, Zhejun Zhao, Yanshu Li et al.
MM-TS: Multi-Modal Temperature and Margin Schedules for Contrastive Learning with Long-Tail Data
Siarhei Sheludzko, Dhimitrios Duka, Bernt Schiele et al.
MMTutorBench: The First Multimodal Benchmark for AI Math Tutoring
Tengchao Yang, Sichen Guo, Mengzhao Jia et al.
MMUIE: Massive Multi-Domain Universal Information Extraction for Long Documents
Shuyi Zhang, Zhenbin Chen, Shuting Li et al.
mmWEAVER: Environment-Specific mmWave Signal Synthesis from a Photo and Activity Description
Mahathir Monjur, Shahriar Nirjon
Mnemis: Dual-Route Retrieval on Hierarchical Graphs for Long-Term LLM Memory
Zihao Tang, Xin Yu, Ziyu Xiao et al.
Mnemosyne: Accelerating Multi-Hop Question Answering via Cache Hit Order Fitting
Haizhou Du, Jiujiu Li, Dongyang Li et al.
MoA: Heterogeneous Mixture of Adapters for Parameter-Efficient Fine-Tuning of Large Language Models
Jie Cao, Tianwei Lin, Bo Yuan et al.
MoA: Mixture of Aggregators Improves Slide-Level Diagnosis in Computational Pathology
Fatih Ozlugedik, Muhammed Furkan Dasdelen, Rao Muhammad Umer et al.
MOA: Multi-Objective Alignment for Role-Playing Agents
Chonghua Liao, Ke Wang, Yuchuan Wu et al.