logical reasoning

298 papers

Explore in graph

Co-occurring keywords

large language model (12755) natural language inference (1278) benchmark evaluation (1539) knowledge graph (1795) first-order logic (168) question answering (2904) symbolic reasoning (184) language model (4573) deductive reasoning (56) neural network (6616)

Papers

PlanningArena: A Modular Benchmark for Multidimensional Evaluation of Planning and Tool Learning ACL 2025

Program Synthesis Benchmark for Visual Programming in XLogoOnline Environment ACL 2025

Unravelling the Logic: Investigating the Generalisation of Transformers in Numerical Satisfiability Problems ACL 2025

RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios ACL 2025

LogicPro: Improving Complex Logical Reasoning via Program-Guided Learning ACL 2025

Harnessing Large Language Models for Knowledge Graph Question Answering via Adaptive Multi-Aspect Retrieval-Augmentation AAAI 2025

Reasoning is All You Need for Video Generalization: A Counterfactual Benchmark with Sub-question Evaluation ACL 2025

Assessing the Sensitivity and Alignment of FOL Closeness Metrics EMNLP 2025

LTRAG: Enhancing Autoformalization and Self-refinement for Logical Reasoning with Thought-Guided RAG ACL 2025

Dissecting Logical Reasoning in LLMs: A Fine-Grained Evaluation and Supervision Study EMNLP 2025

Enhancing Complex Reasoning in Knowledge Graph Question Answering through Query Graph Approximation ACL 2025

DivLogicEval: A Framework for Benchmarking Logical Reasoning Evaluation in Large Language Models EMNLP 2025

Structured Knowledge meets GenAI: A Framework for Logic-Driven Language Models COLING 2025

MultiLogicNMR(er): A Benchmark and Neural-Symbolic Framework for Non-monotonic Reasoning with Multiple Extensions EMNLP 2025

Do Large Language Models excel in Complex Logical Reasoning with Formal Language? EMNLP 2025

On Memorization of Large Language Models in Logical Reasoning IJCNLP 2025

LogiGraph: Logical Reasoning with Contrastive Learning and Lightweight Graph Networks COLING 2025

Probing Logical Reasoning of MLLMs in Scientific Diagrams EMNLP 2025

LogicTree: Structured Proof Exploration for Coherent and Rigorous Logical Reasoning with Large Language Models EMNLP 2025

Semantic Inversion, Identical Replies: Revisiting Negation Blindness in Large Language Models EMNLP 2025

Order Doesn’t Matter, But Reasoning Does: Training LLMs with Order-Centric Augmentation EMNLP 2025

Enhancing Logical Reasoning in Language Models via Symbolically-Guided Monte Carlo Process Supervision EMNLP 2025

SATBench: Benchmarking LLMs’ Logical Reasoning via Automated Puzzle Generation from SAT Formulas EMNLP 2025

Quantifying Logical Consistency in Transformers via Query-Key Alignment EMNLP 2025

Can Large Language Models Win the International Mathematical Games? EMNLP 2025