Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Core AI
Artificial Intelligence
›
Core AI
›
Interpretability
7318 directly classified papers
Papers per year
2003: 1
2006: 1
2007: 1
2008: 1
2009: 1
2010: 5
2012: 2
2013: 10
2014: 7
2015: 14
2016: 27
2017: 84
2018: 196
2019: 395
2020: 488
2021: 771
2022: 823
2023: 954
2024: 1360
2025: 1713
2026: 464
Papers
The Confidence Paradox: Can LLM Know When It’s Wrong?
IJCNLP 2025
The Visual Counter Turing Test (VCT²): A Benchmark for Evaluating AI-Generated Image Detection and the Visual AI Index (V_AI)
IJCNLP 2025
Think Twice: Test-Time Reasoning for Robust CLIP Zero-Shot Classification
ICCV 2025
On the Consistency of Video Large Language Models in Temporal Comprehension
CVPR 2025
ProofTeller: Exposing recency bias in LLM reasoning and its side effects on communication
IJCNLP 2025
SciHallu: A Multi-Granularity Hallucination Detection Dataset for Scientific Writing
IJCNLP 2025
SSA: Semantic Contamination of LLM-Driven Fake News Detection
EMNLP 2025
Fine-grained Confidence Estimation for Spurious Correctness Detection in Large Language Models
IJCNLP 2025
Interpreting Multi-Attribute Confounding through Numerical Attributes in Large Language Models
IJCNLP 2025
Soft Local Completeness: Rethinking Completeness in XAI
ICCV 2025
FedXDS: Leveraging Model Attribution Methods to counteract Data Heterogeneity in Federated Learning
ICCV 2025
VERA: Explainable Video Anomaly Detection via Verbalized Learning of Vision-Language Models
CVPR 2025
Critical Thinking: Which Kinds of Complexity Govern Optimal Reasoning Length?
IJCNLP 2025
How Reliable are Causal Probing Interventions?
IJCNLP 2025
Verbalized Representation Learning for Interpretable Few-Shot Generalization
ICCV 2025
Explainable Ethical Assessment on Human Behaviors by Generating Conflicting Social Norms
IJCNLP 2025
HEARTS: A Holistic Framework for Explainable, Sustainable and Robust Text Stereotype Detection
IJCNLP 2025
Riemannian-Geometric Fingerprints of Generative Models
ICCV 2025
GCAV: A Global Concept Activation Vector Framework for Cross-Layer Consistency in Interpretability
ICCV 2025
Language Models Identify Ambiguities and Exploit Loopholes
EMNLP 2025
Discursive Circuits: How Do Language Models Understand Discourse Relations?
EMNLP 2025
Threading the Needle: Reweaving Chain-of-Thought Reasoning to Explain Human Label Variation
EMNLP 2025
Evaluating Taxonomy Free Character Role Labeling (TF-CRL) in News Stories using Large Language Models
EMNLP 2025
Neuron-Level Differentiation of Memorization and Generalization in Large Language Models
EMNLP 2025
Semi-supervised Concept Bottleneck Models
ICCV 2025
<
1
…
44
45
46
…
293
>