conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Core AI
Artificial Intelligence
›
Core AI
›
Interpretability
7,318 papers
Papers per year
2003: 1
2006: 1
2007: 1
2008: 1
2009: 1
2010: 5
2012: 2
2013: 10
2014: 7
2015: 14
2016: 27
2017: 84
2018: 196
2019: 395
2020: 488
2021: 771
2022: 823
2023: 954
2024: 1360
2025: 1713
2026: 464
Papers
AR-Pro: Counterfactual Explanations for Anomaly Repair with Formal Properties
NIPS 2024
Interpret Your Decision: Logical Reasoning Regularization for Generalization in Visual Classification
NIPS 2024
Finding Transformer Circuits With Edge Pruning
NIPS 2024
Auditing Local Explanations is Hard
NIPS 2024
RGMDT: Return-Gap-Minimizing Decision Tree Extraction in Non-Euclidean Metric Space
NIPS 2024
OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
NIPS 2024
Interpretable Concept-Based Memory Reasoning
NIPS 2024
Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving
NIPS 2024
RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation
NIPS 2024
Parallel Backpropagation for Shared-Feature Visualization
NIPS 2024
Structured Matrix Basis for Multivariate Time Series Forecasting with Interpretable Dynamics
NIPS 2024
Transcoders find interpretable LLM feature circuits
NIPS 2024
Faster Repeated Evasion Attacks in Tree Ensembles
NIPS 2024
Pre-trained Large Language Models Use Fourier Features to Compute Addition
NIPS 2024
DETAIL: Task DEmonsTration Attribution for Interpretable In-context Learning
NIPS 2024
Improving Decision Sparsity
NIPS 2024
Evidence of Learned Look-Ahead in a Chess-Playing Neural Network
NIPS 2024
Interpretable Mesomorphic Networks for Tabular Data
NIPS 2024
Where does In-context Learning Happen in Large Language Models?
NIPS 2024
Graph-based Uncertainty Metrics for Long-form Language Model Generations
NIPS 2024
ClashEval: Quantifying the tug-of-war between an LLM’s internal prior and external evidence
NIPS 2024
Disentangling the Roles of Distinct Cell Classes with Cell-Type Dynamical Systems
NIPS 2024
LLM-Check: Investigating Detection of Hallucinations in Large Language Models
NIPS 2024
Web-Scale Visual Entity Recognition: An LLM-Driven Data Approach
NIPS 2024
CoSy: Evaluating Textual Explanations of Neurons
NIPS 2024
<
1
…
88
89
90
…
293
>