Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Core AI
Artificial Intelligence
›
Core AI
›
Interpretability
7318 directly classified papers
Papers per year
2003: 1
2006: 1
2007: 1
2008: 1
2009: 1
2010: 5
2012: 2
2013: 10
2014: 7
2015: 14
2016: 27
2017: 84
2018: 196
2019: 395
2020: 488
2021: 771
2022: 823
2023: 954
2024: 1360
2025: 1713
2026: 464
Papers
Cross-Refine: Improving Natural Language Explanation Generation by Learning in Tandem
COLING 2025
Cognitive Biases, Task Complexity, and Result Interpretability in Large Language Models
COLING 2025
Towards Faithful Multi-step Reasoning through Fine-Grained Causal-aware Attribution Reasoning Distillation
COLING 2025
Discrete Subgraph Sampling for Interpretable Graph based Visual Question Answering
COLING 2025
A Compliance Checking Framework Based on Retrieval Augmented Generation
COLING 2025
Towards Understanding Multi-Task Learning (Generalization) of LLMs via Detecting and Exploring Task-Specific Neurons
COLING 2025
Revisiting Jailbreaking for Large Language Models: A Representation Engineering Perspective
COLING 2025
NovAScore: A New Automated Metric for Evaluating Document Level Novelty
COLING 2025
Assessing the Human Likeness of AI-Generated Counterspeech
COLING 2025
DiL: An Explainable and Practical Metric for Abnormal Uncertainty in Object Detection
WACV 2025
Towards a Universal Synthetic Video Detector: From Face or Background Manipulations to Fully AI-Generated Content
CVPR 2025
AIM: Amending Inherent Interpretability via Self-Supervised Masking
ICCV 2025
Bridging the Gap Between Ideal and Real-world Evaluation: Benchmarking AI-Generated Image Detection in Challenging Scenarios
ICCV 2025
Leveraging Spatial Invariance to Boost Adversarial Transferability
ICCV 2025
Derailer-Rerailer: Adaptive Verification for Efficient and Reliable Language Model Reasoning
ACL 2025
TruthPrInt: Mitigating Large Vision-Language Models Object Hallucination Via Latent Truthful-Guided Pre-Intervention
ICCV 2025
On the Complexity-Faithfulness Trade-off of Gradient-Based Explanations
ICCV 2025
NCL-UoR at SemEval-2025 Task 3: Detecting Multilingual Hallucination and Related Observable Overgeneration Text Spans with Modified RefChecker and Modified SelfCheckGPT
SEMEVAL 2025
Hallucination Detectives at SemEval-2025 Task 3: Span-Level Hallucination Detection for LLM-Generated Answers
SEMEVAL 2025
Are You Trying to Convince Me or Are You Trying to Deceive Me? Using Argumentation Types to Identify Deceptive News
ACL 2025
LLMSR@XLLM25: SWRV: Empowering Self-Verification of Small Language Models through Step-wise Reasoning and Verification
ACL 2025
LLMSR@XLLM25: An Empirical Study of LLM for Structural Reasoning
ACL 2025
PALI-NLP at SemEval 2025 Task 1: Multimodal Idiom Recognition and Alignment
ACL 2025
CSECU-Learners at SemEval-2025 Task 9: Enhancing Transformer Model for Explainable Food Hazard Detection in Text
ACL 2025
LCTeam at SemEval-2025 Task 3: Multilingual Detection of Hallucinations and Overgeneration Mistakes Using XLM-RoBERTa
ACL 2025
<
1
…
33
34
35
…
293
>