Nature-Inspired Population-Based Evolution of Large Language Models

Yiqun Zhang; Peng Ye; Xiaocui Yang; Shi Feng; Shufei Zhang; LEI BAI; Wanli Ouyang; Shuyue Hu

2026 ACL ACL 2026

Nature-Inspired Population-Based Evolution of Large Language Models

Abstract

AbstractEvolution, the engine behind the survival and growth of life on Earth, operates through the population-based process of reproduction. Inspired by this principle, this paper formally defines a newly emerging problem: the population-based evolution of large language models (LLMs). We introduce a novel framework that starts with a population of parent LLMs and allows this population to evolve through four key operations: (i) crossover, merging the weights of different parents to create offspring LLMs, (ii) mutation, introducing small, random changes to model weights to foster diversity, (iii) selection, prioritizing high-performing models, and (iv) succession, transferring the learned experience from parent to offspring LLMs. With only 200 samples per new task, the LLM population evolves rapidly to adapt to the task at hand, without any gradients. Experiments on 12 datasets show that our framework consistently outperforms existing multi-LLM merging and adaptation methods, achieving relative performance gains of up to 54.8 over the best LLM in the initial population. Moreover, our framework allows for (i) the evolution of LLMs across multiple new tasks simultaneously, (ii) scaling effectively with populations of up to 40 LLMs, and (iii) even zero-shot generalization to unseen held-out tasks. Code: https://github.com/ZhangYiqun018/GENOME

Authors

Yiqun Zhang , Peng Ye , Xiaocui Yang , Shi Feng , Shufei Zhang , LEI BAI , Wanli Ouyang , Shuyue Hu

Topics

Artificial Intelligence > Core AI > Large Language Models Deep Learning > Learning Types > Model Merging Machine Learning > Learning Types > Evolutionary Algorithm

Keywords

model merging evolutionary algorithm large language model population evolution

Download PDF

Related papers

No Reader Left Behind: Multi-Agent Summaries Everyone Can Understand 2026

One-step Nonautoregressive Natural Language Generation with Shortcut Flow Matching Models 2026

Optimizing Retrieval-Augmented Generation for E-Commerce How-To Assistance 2026

Make Mechanistic Interpretability Auditable: A Call to Develop Guidelines via Continuous Collaborative Reviewing 2026

MQM Re-Annotation: A Technique for Collaborative Evaluation of Machine Translation 2026