Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Core AI
Artificial Intelligence
›
Core AI
›
Interpretability
7318 directly classified papers
Papers per year
2003: 1
2006: 1
2007: 1
2008: 1
2009: 1
2010: 5
2012: 2
2013: 10
2014: 7
2015: 14
2016: 27
2017: 84
2018: 196
2019: 395
2020: 488
2021: 771
2022: 823
2023: 954
2024: 1360
2025: 1713
2026: 464
Papers
RPMIL: Rethinking Uncertainty-Aware Probabilistic Multiple Instance Learning for Whole Slide Pathology Diagnosis
IJCAI 2025
Interpretable Mnemonic Generation for Kanji Learning via Expectation-Maximization
EMNLP 2025
Sparse Activation Editing for Reliable Instruction Following in Narratives
EMNLP 2025
Morables: A Benchmark for Assessing Abstract Moral Reasoning in LLMs with Fables
EMNLP 2025
Self-calibration Enhanced Whole Slide Pathology Image Analysis
IJCAI 2025
GraphProt: Certified Black-Box Shielding Against Backdoored Graph Models
IJCAI 2025
Pixels Versus Priors: Controlling Knowledge Priors in Vision-Language Models through Visual Counterfacts
EMNLP 2025
Exploring Supervised Approaches to the Detection of Anthropomorphic Language in the Reporting of NLP Venues
ACL 2025
AdaSteer: Your Aligned LLM is Inherently an Adaptive Jailbreak Defender
EMNLP 2025
A Conformal Risk Control Framework for Granular Word Assessment and Uncertainty Calibration of CLIPScore Quality Estimates
ACL 2025
Critic-CoT: Boosting the Reasoning Abilities of Large Language Model via Chain-of-Thought Critic
ACL 2025
Improving Large Language Models Function Calling and Interpretability via Guided-Structured Templates
EMNLP 2025
Hidden in Plain Sight: Reasoning in Underspecified and Misspecified Scenarios for Multimodal LLMs
EMNLP 2025
TactfulToM: Do LLMs have the Theory of Mind ability to understand White Lies?
EMNLP 2025
Unsupervised Automatic Short Answer Grading and Essay Scoring: A Weakly Supervised Explainable Approach
ACL 2025
SmurfCat at SemEval-2025 Task 3: Bridging External Knowledge and Model Uncertainty for Enhanced Hallucination Detection
ACL 2025
HalluSearch at SemEval-2025 Task 3: A Search-Enhanced RAG Pipeline for Hallucination Detection
ACL 2025
CoKe: Customizable Fine-Grained Story Evaluation via Chain-of-Keyword Rationalization
ACL 2025
Com2 : A Causal-Guided Benchmark for Exploring Complex Commonsense Reasoning in Large Language Models
ACL 2025
We-Math: Does Your Large Multimodal Model Achieve Human-like Mathematical Reasoning?
ACL 2025
Rubrik’s Cube: Testing a New Rubric for Evaluating Explanations on the CUBE dataset
ACL 2025
On the Consistency of Commonsense in Large Language Models
ACL 2025
STRICTA: Structured Reasoning in Critical Text Assessment for Peer Review and Beyond
ACL 2025
Understanding the Dark Side of LLMs’ Intrinsic Self-Correction
ACL 2025
Stochastic Chameleons: Irrelevant Context Hallucinations Reveal Class-Based (Mis)Generalization in LLMs
ACL 2025
<
1
…
19
20
21
…
293
>