conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Core AI
Artificial Intelligence
›
Core AI
›
Interpretability
7,318 papers
Papers per year
2003: 1
2006: 1
2007: 1
2008: 1
2009: 1
2010: 5
2012: 2
2013: 10
2014: 7
2015: 14
2016: 27
2017: 84
2018: 196
2019: 395
2020: 488
2021: 771
2022: 823
2023: 954
2024: 1360
2025: 1713
2026: 464
Papers
Language Models with Rationality
EMNLP 2023
BRAINTEASER: Lateral Thinking Puzzles for Large Language Models
EMNLP 2023
When are Lemons Purple? The Concept Association Bias of Vision-Language Models
EMNLP 2023
FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions
EMNLP 2023
Unveiling the Essence of Poetry: Introducing a Comprehensive Dataset and Benchmark for Poem Summarization
EMNLP 2023
Dr ChatGPT tell me what I want to hear: How different prompts impact health answer correctness
EMNLP 2023
Hop, Union, Generate: Explainable Multi-hop Reasoning without Rationale Supervision
EMNLP 2023
Interactive Text-to-SQL Generation via Editable Step-by-Step Explanations
EMNLP 2023
A Benchmark for Reasoning with Spatial Prepositions
EMNLP 2023
Towards Interpretable and Efficient Automatic Reference-Based Summarization Evaluation
EMNLP 2023
Interpreting and Exploiting Functional Specialization in Multi-Head Attention under Multi-task Learning
EMNLP 2023
Do Transformers Parse while Predicting the Masked Word?
EMNLP 2023
GenEx: A Commonsense-aware Unified Generative Framework for Explainable Cyberbullying Detection
EMNLP 2023
Muted: Multilingual Targeted Offensive Speech Identification and Visualization
EMNLP 2023
Thresh: A Unified, Customizable and Deployable Platform for Fine-Grained Text Evaluation
EMNLP 2023
DRGCoder: Explainable Clinical Coding for the Early Prediction of Diagnostic-Related Groups
EMNLP 2023
NewsSense: Reference-free Verification via Cross-document Comparison
EMNLP 2023
LM-Polygraph: Uncertainty Estimation for Language Models
EMNLP 2023
PROMINET: Prototype-based Multi-View Network for Interpretable Email Response Prediction
EMNLP 2023
Harnessing LLMs for Temporal Data - A Study on Explainable Financial Time Series Forecasting
EMNLP 2023
AutoReply: Detecting Nonsense in Dialogue with Discriminative Replies
EMNLP 2023
Plausibility Processing in Transformer Language Models: Focusing on the Role of Attention Heads in GPT
EMNLP 2023
DelucionQA: Detecting Hallucinations in Domain-specific Question Answering
EMNLP 2023
The Internal State of an LLM Knows When It’s Lying
EMNLP 2023
Towards A Holistic Landscape of Situated Theory of Mind in Large Language Models
EMNLP 2023
<
1
…
165
166
167
…
293
>