chain-of-thought reasoning

469 papers

Explore in graph

Also known as

LONG-COT LCOT XCOT COT

Co-occurring keywords

large language model (12755) reinforcement learning (4122) question answering (2904) multimodal large language model (865) multimodal learning (4622) mathematical reasoning (355) chain of thought (274) knowledge distillation (3680) vision-language model (2235) prompt engineering (1128)

Papers

Trade-offs in Large Reasoning Models: An Empirical Analysis of Deliberative and Adaptive Reasoning over Foundational Capabilities AAAI 2026

STaR: Sensitive Trajectory Regulation for Unlearning in Large Reasoning Models AAAI 2026

CoT-VLNBench: A Benchmark for Visual Chain-of-Thought Reasoning in Vision-Language-Navigation Robots AAAI 2026

Chain-of-Thought Driven Adversarial Scenario Extrapolation for Robust Language Models AAAI 2026

BLM-Guard: Explainable Multimodal Ad Moderation with Chain-of-Thought and Policy-Aligned Rewards AAAI 2026

CRAF: A Clinical Reasoning-Adaptive Framework via Reinforcement Learning for Similar Case Retrieval AAAI 2026

Patho-R1: A Multimodal Reinforcement Learning-Based Pathology Expert Reasoner AAAI 2026

GeM-VG: Towards Generalized Multi-image Visual Grounding with Multimodal Large Language Models AAAI 2026

GraphCoT-VLA: A 3D Spatial-Aware Reasoning Vision-Language-Action Model for Robotic Manipulation with Ambiguous Instructions AAAI 2026

MedEyes: Learning Dynamic Visual Focus for Medical Progressive Diagnosis AAAI 2026

TraveLLaMA: A Multimodal Travel Assistant with Large-Scale Dataset and Structured Reasoning AAAI 2026

U-MIRAGE: Benchmarking Chain-of-Thought Reasoning for Urdu Medical QA EACL 2026

ST-Think: How Multimodal Large Language Models Reason About 4D Worlds from Ego-Centric Videos WACV 2026

FAST-EQA: Efficient Embodied Question Answering with Global and Local Region Relevancy WACV 2026

Incentivizing Strong Reasoning from Weak Supervision EACL 2026

Snap Out of It: A Dual-Process Approach to Mitigating Overthinking in Language Model Reasoning ACL 2025

RedHit: Adaptive Red-Teaming of Large Language Models via Search, Reasoning, and Preference Optimization ACL 2025

Sparks of Tabular Reasoning via Text2SQL Reinforcement Learning ACL 2025

LIMICS at ArchEHR-QA 2025: Prompting LLMs Beats Fine-Tuned Embeddings ACL 2025

IntentionESC: An Intention-Centered Framework for Enhancing Emotional Support in Dialogue Systems ACL 2025

InspireDebate: Multi-Dimensional Subjective-Objective Evaluation-Guided Reasoning and Optimization for Debating ACL 2025

Understanding the Thinking Process of Reasoning Models: A Perspective from Schoenfeld’s Episode Theory EMNLP 2025

Bootstrapping Grounded Chain-of-Thought in Multimodal LLMs for Data-Efficient Model Adaptation ICCV 2025

CoTMR: Chain-of-Thought Multi-Scale Reasoning for Training-Free Zero-Shot Composed Image Retrieval ICCV 2025

STaR-SQL: Self-Taught Reasoner for Text-to-SQL ACL 2025