Artificial Intelligence › Core AI ›

Interpretability

7318 directly classified papers

Papers per year

Papers

The Confidence Paradox: Can LLM Know When It’s Wrong? IJCNLP 2025

The Visual Counter Turing Test (VCT²): A Benchmark for Evaluating AI-Generated Image Detection and the Visual AI Index (V_AI) IJCNLP 2025

Think Twice: Test-Time Reasoning for Robust CLIP Zero-Shot Classification ICCV 2025

On the Consistency of Video Large Language Models in Temporal Comprehension CVPR 2025

ProofTeller: Exposing recency bias in LLM reasoning and its side effects on communication IJCNLP 2025

SciHallu: A Multi-Granularity Hallucination Detection Dataset for Scientific Writing IJCNLP 2025

SSA: Semantic Contamination of LLM-Driven Fake News Detection EMNLP 2025

Fine-grained Confidence Estimation for Spurious Correctness Detection in Large Language Models IJCNLP 2025

Interpreting Multi-Attribute Confounding through Numerical Attributes in Large Language Models IJCNLP 2025

Soft Local Completeness: Rethinking Completeness in XAI ICCV 2025

FedXDS: Leveraging Model Attribution Methods to counteract Data Heterogeneity in Federated Learning ICCV 2025

VERA: Explainable Video Anomaly Detection via Verbalized Learning of Vision-Language Models CVPR 2025

Critical Thinking: Which Kinds of Complexity Govern Optimal Reasoning Length? IJCNLP 2025

How Reliable are Causal Probing Interventions? IJCNLP 2025

Verbalized Representation Learning for Interpretable Few-Shot Generalization ICCV 2025

Explainable Ethical Assessment on Human Behaviors by Generating Conflicting Social Norms IJCNLP 2025

HEARTS: A Holistic Framework for Explainable, Sustainable and Robust Text Stereotype Detection IJCNLP 2025

Riemannian-Geometric Fingerprints of Generative Models ICCV 2025

GCAV: A Global Concept Activation Vector Framework for Cross-Layer Consistency in Interpretability ICCV 2025

Language Models Identify Ambiguities and Exploit Loopholes EMNLP 2025

Discursive Circuits: How Do Language Models Understand Discourse Relations? EMNLP 2025

Threading the Needle: Reweaving Chain-of-Thought Reasoning to Explain Human Label Variation EMNLP 2025

Evaluating Taxonomy Free Character Role Labeling (TF-CRL) in News Stories using Large Language Models EMNLP 2025

Neuron-Level Differentiation of Memorization and Generalization in Large Language Models EMNLP 2025

Semi-supervised Concept Bottleneck Models ICCV 2025