conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Core AI
Artificial Intelligence
›
Core AI
›
Interpretability
7,318 papers
Papers per year
2003: 1
2006: 1
2007: 1
2008: 1
2009: 1
2010: 5
2012: 2
2013: 10
2014: 7
2015: 14
2016: 27
2017: 84
2018: 196
2019: 395
2020: 488
2021: 771
2022: 823
2023: 954
2024: 1360
2025: 1713
2026: 464
Papers
CHARP: Conversation History AwaReness Probing for Knowledge-grounded Dialogue Systems
ACL 2024
k-SemStamp: A Clustering-Based Semantic Watermark for Detection of Machine-Generated Text
ACL 2024
A Graph per Persona: Reasoning about Subjective Natural Language Descriptions
ACL 2024
Direct Evaluation of Chain-of-Thought in Multi-hop Reasoning with Knowledge Graphs
ACL 2024
Concept-Best-Matching: Evaluating Compositionality In Emergent Communication
ACL 2024
Enhancing Semantic Consistency of Large Language Models through Model Editing: An Interpretability-Oriented Approach
ACL 2024
Data-Centric Explainable Debiasing for Improving Fairness in Pre-trained Language Models
ACL 2024
Just Ask One More Time! Self-Agreement Improves Reasoning of Language Models in (Almost) All Scenarios
ACL 2024
Discerning and Resolving Knowledge Conflicts through Adaptive Decoding with Contextual Information-Entropy Constraint
ACL 2024
A Mechanistic Analysis of a Transformer Trained on a Symbolic Multi-Step Reasoning Task
ACL 2024
SyntaxShap: Syntax-aware Explainability Method for Text Generation
ACL 2024
Automated Detection and Analysis of Data Practices Using A Real-World Corpus
ACL 2024
Decomposing Co-occurrence Matrices into Interpretable Components as Formal Concepts
ACL 2024
Locating and Extracting Relational Concepts in Large Language Models
ACL 2024
Plausible Extractive Rationalization through Semi-Supervised Entailment Signal
ACL 2024
Which Information Matters? Dissecting Human-written Multi-document Summaries with Partial Information Decomposition
ACL 2024
Unveiling Selection Biases: Exploring Order and Token Sensitivity in Large Language Models
ACL 2024
LJPCheck: Functional Tests for Legal Judgment Prediction
ACL 2024
When to Trust LLMs: Aligning Confidence with Response Quality
ACL 2024
Mitigating Hallucinations in Large Vision-Language Models (LVLMs) via Language-Contrastive Decoding (LCD)
ACL 2024
Exploring Spatial Schema Intuitions in Large Language and Vision Models
ACL 2024
Too Big to Fail: Larger Language Models are Disproportionately Resilient to Induction of Dementia-Related Linguistic Anomalies
ACL 2024
Knowledge of Knowledge: Exploring Known-Unknowns Uncertainty with Large Language Models
ACL 2024
Identifying Semantic Induction Heads to Understand In-Context Learning
ACL 2024
Spotting AI’s Touch: Identifying LLM-Paraphrased Spans in Text
ACL 2024
<
1
…
106
107
108
…
293
>