Counteracting the Matthew Effect in Self-Improvement of LVLMs through Head-Tail Re-balancing

Xin Guo; Zhiheng Xi; Yiwen Ding; Yitao Zhai; Xiaowei Shi; Xunliang Cai; Tao Gui; Qi Zhang; Xuanjing Huang

2026 ACL ACL 2026

Counteracting the Matthew Effect in Self-Improvement of LVLMs through Head-Tail Re-balancing

Abstract

AbstractSelf-improvement has emerged as a mainstream paradigm for advancing the reasoning capabilities of large vision–language models (LVLMs), where models explore and learn from successful trajectories iteratively. However, we identify a critical imbalance during this process: the model readily generates high-quality trajectories for simple queries (i.e., head data) but struggles with complex ones (i.e., tail data). This bias drives the optimization to disproportionately prioritize simple reasoning skills, while inhibiting the acquisition of complex capabilities. As iterations progress, this imbalance becomes more acute—a dynamic we term the "Matthew effect", ultimately stalling performance gains. To mitigate this, we approach head-tail re-balance during the exploration-and-learning process from two perspectives: distribution-reshaping and trajectory-resampling. Extensive experiments on Qwen2-VL-7B-Instruct and InternVL2.5-4B models across visual reasoning tasks demonstrate that our methods consistently improve visual reasoning capabilities, outperforming vanilla self-improvement baselines by an average of 3.86 points.

Authors

Xin Guo , Zhiheng Xi , Yiwen Ding , Yitao Zhai , Xiaowei Shi , Xunliang Cai , Tao Gui , Qi Zhang , Xuanjing Huang

Topics

Artificial Intelligence > Core AI > Multimodal Learning Deep Learning > Learning Types > Self-Supervised Learning Artificial Intelligence > Core AI > Vision-Language Models

Keywords

visual reasoning vision language model matthew effect self improvement head tail re-balancing

Download PDF

Related papers

No Reader Left Behind: Multi-Agent Summaries Everyone Can Understand 2026

One-step Nonautoregressive Natural Language Generation with Shortcut Flow Matching Models 2026

Optimizing Retrieval-Augmented Generation for E-Commerce How-To Assistance 2026

Make Mechanistic Interpretability Auditable: A Call to Develop Guidelines via Continuous Collaborative Reviewing 2026

MQM Re-Annotation: A Technique for Collaborative Evaluation of Machine Translation 2026