conftrace_

Artificial Intelligence › Core AI ›

Interpretability

7,318 papers

Papers per year

Papers

AR-Pro: Counterfactual Explanations for Anomaly Repair with Formal Properties NIPS 2024

Interpret Your Decision: Logical Reasoning Regularization for Generalization in Visual Classification NIPS 2024

Finding Transformer Circuits With Edge Pruning NIPS 2024

Auditing Local Explanations is Hard NIPS 2024

RGMDT: Return-Gap-Minimizing Decision Tree Extraction in Non-Euclidean Metric Space NIPS 2024

OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI NIPS 2024

Interpretable Concept-Based Memory Reasoning NIPS 2024

Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving NIPS 2024

RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation NIPS 2024

Parallel Backpropagation for Shared-Feature Visualization NIPS 2024

Structured Matrix Basis for Multivariate Time Series Forecasting with Interpretable Dynamics NIPS 2024

Transcoders find interpretable LLM feature circuits NIPS 2024

Faster Repeated Evasion Attacks in Tree Ensembles NIPS 2024

Pre-trained Large Language Models Use Fourier Features to Compute Addition NIPS 2024

DETAIL: Task DEmonsTration Attribution for Interpretable In-context Learning NIPS 2024

Improving Decision Sparsity NIPS 2024

Evidence of Learned Look-Ahead in a Chess-Playing Neural Network NIPS 2024

Interpretable Mesomorphic Networks for Tabular Data NIPS 2024

Where does In-context Learning Happen in Large Language Models? NIPS 2024

Graph-based Uncertainty Metrics for Long-form Language Model Generations NIPS 2024

ClashEval: Quantifying the tug-of-war between an LLM’s internal prior and external evidence NIPS 2024

Disentangling the Roles of Distinct Cell Classes with Cell-Type Dynamical Systems NIPS 2024

LLM-Check: Investigating Detection of Hallucinations in Large Language Models NIPS 2024

Web-Scale Visual Entity Recognition: An LLM-Driven Data Approach NIPS 2024

CoSy: Evaluating Textual Explanations of Neurons NIPS 2024