conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Core AI
Artificial Intelligence
›
Core AI
›
Interpretability
7,318 papers
Papers per year
2003: 1
2006: 1
2007: 1
2008: 1
2009: 1
2010: 5
2012: 2
2013: 10
2014: 7
2015: 14
2016: 27
2017: 84
2018: 196
2019: 395
2020: 488
2021: 771
2022: 823
2023: 954
2024: 1360
2025: 1713
2026: 464
Papers
Propagation and Pitfalls: Reasoning-based Assessment of Knowledge Editing through Counterfactual Tasks
ACL 2024
Paying Attention to Deflections: Mining Pragmatic Nuances for Whataboutism Detection in Online Discourse
ACL 2024
Epistemology of Language Models: Do Language Models Have Holistic Knowledge?
ACL 2024
Competition-Level Problems are Effective LLM Evaluators
ACL 2024
Argument-Based Sentiment Analysis on Forward-Looking Statements
ACL 2024
LLMs cannot find reasoning errors, but can correct them given the error location
ACL 2024
TextGenSHAP: Scalable Post-Hoc Explanations in Text Generation with Long Documents
ACL 2024
Learning Fine-Grained Grounded Citations for Attributed Large Language Models
ACL 2024
Unsupervised Real-Time Hallucination Detection based on the Internal States of Large Language Models
ACL 2024
Preemptive Answer “Attacks” on Chain-of-Thought Reasoning
ACL 2024
Automatic Bug Detection in LLM-Powered Text-Based Games Using LLMs
ACL 2024
What Makes Language Models Good-enough?
ACL 2024
TELLER: A Trustworthy Framework for Explainable, Generalizable and Controllable Fake News Detection
ACL 2024
Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive Decoding
ACL 2024
A Critical Study of What Code-LLMs (Do Not) Learn
ACL 2024
On the Robustness of Document-Level Relation Extraction Models to Entity Name Variations
ACL 2024
How and where does CLIP process negation?
ACL 2024
Improving Language Models Trained on Translated Data with Continual Pre-Training and Dictionary Learning Analysis
ACL 2024
CUET_sstm at ArAIEval Shared Task: Unimodal (Text) Propagandistic Technique Detection Using Transformer-Based Model
ACL 2024
Uot1 at FIGNEWS 2024 Shared Task: Labeling News Bias
ACL 2024
The Guidelines Specialists at FIGNEWS 2024 Shared Task: An annotation guideline to Unravel Bias in News Media Narratives Using a Linguistic Approach
ACL 2024
Evaluating the Robustness of Adverse Drug Event Classification Models using Templates
ACL 2024
Automatic Extraction of Disease Risk Factors from Medical Publications
ACL 2024
XAI for Better Exploitation of Text in Medical Decision Support
ACL 2024
Large language models fail to derive atypicality inferences in a human-like manner
ACL 2024
<
1
…
108
109
110
…
293
>