Guidelines as Environments: A World Model Approach to Rule Following

Haiqing Li; Wenliang Zhong; Yinhao Wu; Hehuan Ma; Yuzhi Guo; Thao M. Dang; Junzhou Huang

2026 ACL ACL 2026

Guidelines as Environments: A World Model Approach to Rule Following

Abstract

AbstractGuideline-following is increasingly important in compliance, customer support, and other regulated workflows, where correctness is defined by explicit rule systems rather than heuristics. Learning to follow guidelines is challenging because guidelines are interdependent: rules can trigger, suppress, or conflict with one another, while locally plausible responses may violate global constraints. Most existing methods treat guidelines as static text and rely on implicit reasoning or deeper decoding, making rule interactions and satisfaction status hard to observe and control. A more feasible approach is to model guideline execution with an explicit state that tracks evolving rule evidence across steps. However, conventional world models are a poor fit: they typically assume privileged feedback or well-defined transition dynamics, assumptions that do not hold when reasoning occurs purely in language space under ambiguous, text-defined constraints. As a solution, we propose RGCWM, a Rule-Grounded Causal World Model that builds an explicit state space from the guideline text itself. RGCWM represents rule applicability and satisfaction as a continuously updated evidence state, externalizes inter-rule dependencies as a causal structure, and plans at inference time by counterfactually evaluating candidate responses under model-estimated state transitions. Experiments show that this shift from implicit text reasoning to state-based reasoning enables stable, controllable execution of complex interacting rules across diverse domains.

Authors

Haiqing Li , Wenliang Zhong , Yinhao Wu , Hehuan Ma , Yuzhi Guo , Thao M. Dang , Junzhou Huang

Topics

Artificial Intelligence > Core AI > Causal Inference Artificial Intelligence > Core AI > Planning Artificial Intelligence > Core AI > Reasoning

Keywords

causal structure world model rule following state-based reasoning

Download PDF

Related papers

No Reader Left Behind: Multi-Agent Summaries Everyone Can Understand 2026

One-step Nonautoregressive Natural Language Generation with Shortcut Flow Matching Models 2026

Optimizing Retrieval-Augmented Generation for E-Commerce How-To Assistance 2026

Make Mechanistic Interpretability Auditable: A Call to Develop Guidelines via Continuous Collaborative Reviewing 2026

MQM Re-Annotation: A Technique for Collaborative Evaluation of Machine Translation 2026