conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Core AI
Artificial Intelligence
›
Core AI
›
Interpretability
7,318 papers
Papers per year
2003: 1
2006: 1
2007: 1
2008: 1
2009: 1
2010: 5
2012: 2
2013: 10
2014: 7
2015: 14
2016: 27
2017: 84
2018: 196
2019: 395
2020: 488
2021: 771
2022: 823
2023: 954
2024: 1360
2025: 1713
2026: 464
Papers
Make Your Decision Convincing! A Unified Two-Stage Framework: Self-Attribution and Decision-Making
EMNLP 2023
Self-Supervised Rule Learning to Link Text Segments to Relational Elements of Structured Knowledge
EMNLP 2023
Towards Mitigating LLM Hallucination via Self Reflection
EMNLP 2023
ExplainCPE: A Free-text Explanation Benchmark of Chinese Pharmacist Examination
EMNLP 2023
Reducing Spurious Correlations in Aspect-based Sentiment Analysis with Explanation from Large Language Models
EMNLP 2023
On Uncertainty Calibration and Selective Generation in Probabilistic Neural Summarization: A Benchmark Study
EMNLP 2023
Measuring Faithful and Plausible Visual Grounding in VQA
EMNLP 2023
Extractive Summarization via ChatGPT for Faithful Summary Generation
EMNLP 2023
PsyAttention: Psychological Attention Model for Personality Detection
EMNLP 2023
Test-time Augmentation for Factual Probing
EMNLP 2023
A New Benchmark and Reverse Validation Method for Passage-level Hallucination Detection
EMNLP 2023
APP: Adaptive Prototypical Pseudo-Labeling for Few-shot OOD Detection
EMNLP 2023
Proto-lm: A Prototypical Network-Based Framework for Built-in Interpretability in Large Language Models
EMNLP 2023
Transparency at the Source: Evaluating and Interpreting Language Models With Access to the True Distribution
EMNLP 2023
Statistically Profiling Biases in Natural Language Reasoning Datasets and Models
EMNLP 2023
Verb Conjugation in Transformers Is Determined by Linear Encodings of Subject Number
EMNLP 2023
That was the last straw, we need more: Are Translation Systems Sensitive to Disambiguating Context?
EMNLP 2023
MindGames: Targeting Theory of Mind in Large Language Models with Dynamic Epistemic Modal Logic
EMNLP 2023
ZARA: Improving Few-Shot Self-Rationalization for Small Language Models
EMNLP 2023
Robustness Tests for Automatic Machine Translation Metrics with Adversarial Attacks
EMNLP 2023
Is Probing All You Need? Indicator Tasks as an Alternative to Probing Embedding Spaces
EMNLP 2023
DeltaScore: Fine-Grained Story Evaluation with Perturbations
EMNLP 2023
InterroLang: Exploring NLP Models and Datasets through Dialogue-based Explanations
EMNLP 2023
INVITE: a Testbed of Automatically Generated Invalid Questions to Evaluate Large Language Models for Hallucinations
EMNLP 2023
HARE: Explainable Hate Speech Detection with Step-by-Step Reasoning
EMNLP 2023
<
1
…
166
167
168
…
293
>