conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Core AI
Artificial Intelligence
›
Core AI
›
Interpretability
7,318 papers
Papers per year
2003: 1
2006: 1
2007: 1
2008: 1
2009: 1
2010: 5
2012: 2
2013: 10
2014: 7
2015: 14
2016: 27
2017: 84
2018: 196
2019: 395
2020: 488
2021: 771
2022: 823
2023: 954
2024: 1360
2025: 1713
2026: 464
Papers
Benchmarking Heterogeneous Treatment Effect Models through the Lens of Interpretability
NIPS 2022
Decision Trees with Short Explainable Rules
NIPS 2022
RKHS-SHAP: Shapley Values for Kernel Methods
NIPS 2022
Repairing Neural Networks by Leaving the Right Past Behind
NIPS 2022
Constrained Predictive Coding as a Biologically Plausible Model of the Cortical Hierarchy
NIPS 2022
Washing The Unwashable : On The (Im)possibility of Fairwashing Detection
NIPS 2022
Where do Models go Wrong? Parameter-Space Saliency Maps for Explainability
NIPS 2022
Task Discovery: Finding the Tasks that Neural Networks Generalize on
NIPS 2022
OpenXAI: Towards a Transparent Evaluation of Model Explanations
NIPS 2022
Normalizing Flows for Knockoff-free Controlled Feature Selection
NIPS 2022
Inherently Explainable Reinforcement Learning in Natural Language
NIPS 2022
Neural Topological Ordering for Computation Graphs
NIPS 2022
CEBaB: Estimating the Causal Effects of Real-World Concepts on NLP Model Behavior
NIPS 2022
Learning to Generate Inversion-Resistant Model Explanations
NIPS 2022
If Influence Functions are the Answer, Then What is the Question?
NIPS 2022
Probing Classifiers are Unreliable for Concept Removal and Detection
NIPS 2022
SkinCon: A skin disease dataset densely annotated by domain experts for fine-grained debugging and analysis
NIPS 2022
Convergent Representations of Computer Programs in Human and Artificial Neural Networks
NIPS 2022
GStarX: Explaining Graph Neural Networks with Structure-Aware Cooperative Games
NIPS 2022
AutoMS: Automatic Model Selection for Novelty Detection with Error Rate Control
NIPS 2022
Chroma-VAE: Mitigating Shortcut Learning with Generative Classifiers
NIPS 2022
Exploiting the Relationship Between Kendall's Rank Correlation and Cosine Similarity for Attribution Protection
NIPS 2022
Additive MIL: Intrinsically Interpretable Multiple Instance Learning for Pathology
NIPS 2022
Boosting Out-of-distribution Detection with Typical Features
NIPS 2022
Evaluating Latent Space Robustness and Uncertainty of EEG-ML Models under Realistic Distribution Shifts
NIPS 2022
<
1
…
181
182
183
…
293
>