Dialogue is the Plan: From Interface to Joint Action in Agentic AI
Abstract
AbstractLarge Language Model agents can seeminglyplan and act, yet their language use is oftentreated primarily as an interface for instructingactions and reporting results. We argue that thisframing is one important cause of recurrent coordination failures in human-facing and multiagent settings, including ungrounded assumptions, silent goal misalignment, brittle protocoladherence, and failures to maintain or updateshared dialogue state over time, a limitation previously linked to the absence of explicit common ground tracking in collaborative systems(Geib et al., 2022). Drawing from classical dialogue system research on joint action, commonground, grounding, repair, and incremental processing, we re-frame dialogue as part of theplanning loop itself (rather than its output). Wedistill this re-framing into concrete implicationsfor agentic architecture and evaluation, including explicit representations of shared commitments, clarification as a first class action available to the policy, and process metrics that approximate grounding behavior, repair, and commitment formation rather than task completionalone. We lastly discuss how dialogue-centeredrequirements can inform standards and governance for safe deployment of agentic systems.