Disentangling Reasoning Logic to Resolve Explicit Knowledge Conflicts

Xianda Zheng; Zijian Huang; Meng-Fen Chiang; Jiamou Liu; Yuan Fang; Michael J. Witbrock; Kaiqi Zhao

2026 ACL ACL 2026

Disentangling Reasoning Logic to Resolve Explicit Knowledge Conflicts

Abstract

AbstractExplicit knowledge conflicts, where retrieved contexts contain contradictory information, have become increasingly prevalent as Large Language Models (LLMs) integrate diverse data sources. The core challenge lies in the complexity of entangled narratives and the heterogeneity of conflict cases, which impose excessive demands on the reasoning capabilities of standard models. To address this, we propose Knowledge Conflict Reasoning (KCR), a framework that adjudicates conflicts by structuring the underlying logic. KCR first disentangles conflicting contexts into distinct sets of reasoning traces, utilizing both textual and graph-based representations, to simplify comprehension. It then employs a Reinforcement Learning with Verifiable Rewards (RLVR) paradigm, guiding the model to internalize a reasoning process that maximizes logical consistency while actively suppressing spurious reasoning paths derived from contradictory contexts. Extensive experiments demonstrate that KCR yields substantial improvements: a KCR-enhanced 7B model surpasses the performance of baselines equipped with top-tier closed-source models such as GPT-4o and GPT-5.1.

Authors

Xianda Zheng , Zijian Huang , Meng-Fen Chiang , Jiamou Liu , Yuan Fang , Michael J. Witbrock , Kaiqi Zhao

Topics

Artificial Intelligence > Core AI > Large Language Models Artificial Intelligence > Core AI > Reasoning Deep Learning > Learning Types > Reinforcement Learning

Keywords

reasoning trace large language model reinforcement learning with verifiable reward knowledge conflict reasoning spurious reasoning path

Download PDF

Related papers

No Reader Left Behind: Multi-Agent Summaries Everyone Can Understand 2026

One-step Nonautoregressive Natural Language Generation with Shortcut Flow Matching Models 2026

Optimizing Retrieval-Augmented Generation for E-Commerce How-To Assistance 2026

Make Mechanistic Interpretability Auditable: A Call to Develop Guidelines via Continuous Collaborative Reviewing 2026

MQM Re-Annotation: A Technique for Collaborative Evaluation of Machine Translation 2026