Papers
MAU-GPT: Enhancing Multi-type Industrial Anomaly Understanding via Anomaly-aware and Generalist Experts Adaptation
Zhuonan Wang, Zhenxuan Fan, Siwen Tan et al.
MAVERIX: Multimodal Audio-Visual Evaluation and Recognition IndeX
Liuyue Xie, Avik Kuthiala, George Z Wei et al.
MAVIS: A Benchmark for Multimodal Source Attribution in Long-form Visual Question Answering
Seokwon Song, Minsu Park, Gunhee Kim
MAViS: A Multi-Agent Framework for Long-Sequence Video Storytelling
Qian Wang, Ziqi Huang, Ruoxi Jia et al.
Maximizing Schatten-p Norm Regularization Toward Balance
Fangfang Li, Quanxue Gao, Yapeng Wang et al.
MaxInfo: A Training-Free Key-Frame Selection Method Using Maximum Volume for Enhanced Video Understanding
Pengyi Li, Irina Abdullaeva, Alexander Gambashidze et al.
MBTI: Metric-Based Textual Inversion for Fine-Grained Image Generation
Byungkwan Chae, Youngjae Choi, Heewon Kim
MBZUAI at AMIYA Shared Task 2026: Adapting Open-Source LLMs for Dialectal Arabic
Rana Gaber, Yara Allam, Serag Amin et al.
MCA-Bench: A Multimodal Benchmark for Evaluating CAPTCHA Robustness Against VLM-based Attacks
Zonglin Wu, Yule Xue, Yaoyao Feng et al.
MCGS: Markov Chain Gaussian Splatting for Dynamic Scenes Reconstruction
Yuzhong Wang, Wenmin Wang, Shixiong Zhang et al.
MCIE: Multimodal LLM-Driven Complex Instruction Image Editing with Spatial Guidance
Xuehai Bai, Xiaoling Gu, Akide Liu et al.
MCI-Net: A Robust Multi-Domain Context Integration Network for Point Cloud Registration
Shuyuan Lin, Wenwu Peng, Junjie Huang et al.
McMining: Automated Discovery of Misconceptions in Student Code
Erfan Al-Hossami, Razvan Bunescu
MCMoE: Completing Missing Modalities with Mixture of Experts for Incomplete Multimodal Action Quality Assessment
Huangbiao Xu, Huanqi Wu, Xiao Ke et al.
MCP-AgentBench: Evaluating Real-World Language Agent Performance with MCP-Mediated Tools
Zikang Guo, Benfeng Xu, Chiwei Zhu et al.
MCPTox: A Benchmark for Tool Poisoning on Real-World MCP Servers
Zhiqiang Wang, Yichao Gao, Yanting Wang et al.
MCTSr-Zero: Self-Reflective Psychological Counseling Dialogues Generation via Principles and Adaptive Exploration
Hao Lu, Yanchi Gu, Haoyuan Huang et al.
MCTS-SQL: Light-Weight LLMs Can Master the Text-to-SQL Through Monte Carlo Tree Search
Shuozhi Yuan, Liming Chen, Miaomiao Yuan et al.
MCW-KD: Multi-Cost Wasserstein Knowledge Distillation for Large Language Models
Hoang Tran Vuong, Tue Le, Quyen Tran et al.
MdaIF: Robust One-Stop Multi-Degradation-Aware Image Fusion with Language-Driven Semantics
Jing Li, Yifan Wang, Jiafeng Yan et al.
MDBench: Benchmarking Data-Driven Methods for Model Discovery
Amirmohammad Ziaei Bideh, Aleksandra Georgievska, Jonathan Gryak
MDF: A Modality-Aware Disentanglement and Fusion Framework for Multimodal Sentiment Analysis
Zhongquan Jian, Wenhan Lv, Yanhao Chen et al.
MDiff4STR: Mask Diffusion Model for Scene Text Recognition
Yongkun Du, Miaomiao Zhao, Songlin Fan et al.
MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models
Pengfei Zhou, Xiaopeng Peng, Fanrui Zhang et al.
MDMLP-EIA: Multi-domain Dynamic MLPs with Energy Invariant Attention for Time Series Forecasting
Hu Zhang, Zhien Dai, Zhaohui Tang et al.