Immediate Inference: The Missing Foundation in Large Language Model Logical Reasoning

Sihang Jiang; Zhiyu Lu; Keyi Wang; Jiaqing Liang; Yanghua Xiao; Xiaojun Meng; Jiansheng Wei

2026 ACL ACL 2026

Immediate Inference: The Missing Foundation in Large Language Model Logical Reasoning

Abstract

AbstractWhile extensive research has evaluated LLMs on complex reasoning tasks, the foundational building blocks of logical reasoning remain underexplored. We introduce IIBench, a benchmark evaluating immediate inference (elementary operations over categorical propositions). Our evaluation reveals that even SoTA models exhibit systematic deficiencies in immediate inference, and establishes immediate inference as foundational: it mediates approximately 40% of the effect on syllogistic reasoning, with near-perfect correlation ( = 0.98) across reasoning benchmarks. Our analysis reveals that models lack robust operator grounding, oscillating between structural reasoning and surface pattern matching with inconsistent handling of quantifiers and negation.

Authors

Sihang Jiang , Zhiyu Lu , Keyi Wang , Jiaqing Liang , Yanghua Xiao , Xiaojun Meng , Jiansheng Wei

Topics

Artificial Intelligence > Core AI > Large Language Models Artificial Intelligence > Core AI > Reasoning Artificial Intelligence > Core AI > Evaluation

Keywords

logical reasoning syllogistic reasoning immediate inference categorical proposition operator grounding

Download PDF

Related papers

No Reader Left Behind: Multi-Agent Summaries Everyone Can Understand 2026

One-step Nonautoregressive Natural Language Generation with Shortcut Flow Matching Models 2026

Optimizing Retrieval-Augmented Generation for E-Commerce How-To Assistance 2026

Make Mechanistic Interpretability Auditable: A Call to Develop Guidelines via Continuous Collaborative Reviewing 2026

MQM Re-Annotation: A Technique for Collaborative Evaluation of Machine Translation 2026