conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Core AI
Artificial Intelligence
›
Core AI
›
Interpretability
7,318 papers
Papers per year
2003: 1
2006: 1
2007: 1
2008: 1
2009: 1
2010: 5
2012: 2
2013: 10
2014: 7
2015: 14
2016: 27
2017: 84
2018: 196
2019: 395
2020: 488
2021: 771
2022: 823
2023: 954
2024: 1360
2025: 1713
2026: 464
Papers
ALiiCE: Evaluating Positional Fine-grained Citation Generation
NAACL 2025
On Behalf of the Stakeholders: Trends in NLP Model Interpretability in the Era of LLMs
NAACL 2025
Does Liking Yellow Imply Driving a School Bus? Semantic Leakage in Language Models
NAACL 2025
Reversed Attention: On The Gradient Descent Of Attention Layers In GPT
NAACL 2025
RAP: A Metric for Balancing Repetition and Performance in Open-Source Large Language Models
NAACL 2025
What Did I Do Wrong? Quantifying LLMs’ Sensitivity and Consistency to Prompt Engineering
NAACL 2025
Option Symbol Matters: Investigating and Mitigating Multiple-Choice Option Symbol Bias of Large Language Models
NAACL 2025
Information-Guided Identification of Training Data Imprint in (Proprietary) Large Language Models
NAACL 2025
An Interpretable and Crosslingual Method for Evaluating Second-Language Dialogues
NAACL 2025
Discourse-Driven Evaluation: Unveiling Factual Inconsistency in Long Document Summarization
NAACL 2025
Token-Level Density-Based Uncertainty Quantification Methods for Eliciting Truthfulness of Large Language Models
NAACL 2025
From Redundancy to Relevance: Information Flow in LVLMs Across Reasoning Tasks
NAACL 2025
SafeQuant: LLM Safety Analysis via Quantized Gradient Inspection
NAACL 2025
NLI under the Microscope: What Atomic Hypothesis Decomposition Reveals
NAACL 2025
Benchmarking Language Model Creativity: A Case Study on Code Generation
NAACL 2025
Probe-Free Low-Rank Activation Intervention
NAACL 2025
Racing Thoughts: Explaining Contextualization Errors in Large Language Models
NAACL 2025
A Probabilistic Framework for LLM Hallucination Detection via Belief Tree Propagation
NAACL 2025
Incremental Sentence Processing Mechanisms in Autoregressive Transformer Language Models
NAACL 2025
Extracting and Understanding the Superficial Knowledge in Alignment
NAACL 2025
LLM The Genius Paradox: A Linguistic and Math Expert’s Struggle with Simple Word-based Counting Problems
NAACL 2025
Who Relies More on World Knowledge and Bias for Syntactic Ambiguity Resolution: Humans or LLMs?
NAACL 2025
Benchmarking Large Language Models on Answering and Explaining Challenging Medical Questions
NAACL 2025
What We Talk About When We Talk About LMs: Implicit Paradigm Shifts and the Ship of Language Models
NAACL 2025
MMEvalPro: Calibrating Multimodal Benchmarks Towards Trustworthy and Efficient Evaluation
NAACL 2025
<
1
…
79
80
81
…
293
>