How Memory Management Impacts LLM Agents: An Empirical Study of Experience-Following Behavior

Zidi Xiong; Yuping Lin; Wenya Xie; Pengfei He; Zirui Liu; Jiliang Tang; Himabindu Lakkaraju; Zhen Xiang

2026 ACL ACL 2026

How Memory Management Impacts LLM Agents: An Empirical Study of Experience-Following Behavior

Abstract

AbstractMemory is a critical component in large language model (LLM)-based agents, enabling them to store and retrieve past executions to improve task performance over time. In this paper, we conduct an empirical study on how memory management choices impact the LLM agents’ behavior, especially their long-term performance. Specifically, we focus on two fundamental memory management operations that are widely used by many agent frameworks—memory addition and deletion—to systematically study their impact on the agent behavior. Through our quantitative analysis, we find that LLM agents display an *experience-following* property: high similarity between a task input and the input in a retrieved memory record often results in highly similar agent outputs. Our analysis further reveals two significant challenges associated with this property: *error propagation*, where inaccuracies in past experiences compound and degrade future performance, and *misaligned experience replay*, where some seemingly correct executions can provide limited or even misleading value as experiences. Through controlled experiments, we demonstrate the importance of regulating experience quality within the memory bank and show that future task evaluations can serve as free quality labels for stored memory. Our findings offer insights into the behavioral dynamics of LLM agent memory systems and provide practical guidance for designing memory components that support robust, long-term agent performance.

Authors

Zidi Xiong , Yuping Lin , Wenya Xie , Pengfei He , Zirui Liu , Jiliang Tang , Himabindu Lakkaraju , Zhen Xiang

Topics

Artificial Intelligence > Core AI > Agent Systems Artificial Intelligence > Core AI > Memory Artificial Intelligence > Core AI > Large Language Models

Keywords

error propagation experience replay memory management llm agent experience following behavior

Download PDF

Related papers

No Reader Left Behind: Multi-Agent Summaries Everyone Can Understand 2026

One-step Nonautoregressive Natural Language Generation with Shortcut Flow Matching Models 2026

Optimizing Retrieval-Augmented Generation for E-Commerce How-To Assistance 2026

Make Mechanistic Interpretability Auditable: A Call to Develop Guidelines via Continuous Collaborative Reviewing 2026

MQM Re-Annotation: A Technique for Collaborative Evaluation of Machine Translation 2026