Artificial Intelligence › Core AI ›

Interpretability

7318 directly classified papers

Papers per year

Papers

RPMIL: Rethinking Uncertainty-Aware Probabilistic Multiple Instance Learning for Whole Slide Pathology Diagnosis IJCAI 2025

Interpretable Mnemonic Generation for Kanji Learning via Expectation-Maximization EMNLP 2025

Sparse Activation Editing for Reliable Instruction Following in Narratives EMNLP 2025

Morables: A Benchmark for Assessing Abstract Moral Reasoning in LLMs with Fables EMNLP 2025

Self-calibration Enhanced Whole Slide Pathology Image Analysis IJCAI 2025

GraphProt: Certified Black-Box Shielding Against Backdoored Graph Models IJCAI 2025

Pixels Versus Priors: Controlling Knowledge Priors in Vision-Language Models through Visual Counterfacts EMNLP 2025

Exploring Supervised Approaches to the Detection of Anthropomorphic Language in the Reporting of NLP Venues ACL 2025

AdaSteer: Your Aligned LLM is Inherently an Adaptive Jailbreak Defender EMNLP 2025

A Conformal Risk Control Framework for Granular Word Assessment and Uncertainty Calibration of CLIPScore Quality Estimates ACL 2025

Critic-CoT: Boosting the Reasoning Abilities of Large Language Model via Chain-of-Thought Critic ACL 2025

Improving Large Language Models Function Calling and Interpretability via Guided-Structured Templates EMNLP 2025

Hidden in Plain Sight: Reasoning in Underspecified and Misspecified Scenarios for Multimodal LLMs EMNLP 2025

TactfulToM: Do LLMs have the Theory of Mind ability to understand White Lies? EMNLP 2025

Unsupervised Automatic Short Answer Grading and Essay Scoring: A Weakly Supervised Explainable Approach ACL 2025

SmurfCat at SemEval-2025 Task 3: Bridging External Knowledge and Model Uncertainty for Enhanced Hallucination Detection ACL 2025

HalluSearch at SemEval-2025 Task 3: A Search-Enhanced RAG Pipeline for Hallucination Detection ACL 2025

CoKe: Customizable Fine-Grained Story Evaluation via Chain-of-Keyword Rationalization ACL 2025

Com2 : A Causal-Guided Benchmark for Exploring Complex Commonsense Reasoning in Large Language Models ACL 2025

We-Math: Does Your Large Multimodal Model Achieve Human-like Mathematical Reasoning? ACL 2025

Rubrik’s Cube: Testing a New Rubric for Evaluating Explanations on the CUBE dataset ACL 2025

On the Consistency of Commonsense in Large Language Models ACL 2025

STRICTA: Structured Reasoning in Critical Text Assessment for Peer Review and Beyond ACL 2025

Understanding the Dark Side of LLMs’ Intrinsic Self-Correction ACL 2025

Stochastic Chameleons: Irrelevant Context Hallucinations Reveal Class-Based (Mis)Generalization in LLMs ACL 2025