Artificial Intelligence › Core AI ›

Interpretability

7318 directly classified papers

Papers per year

Papers

Cross-Refine: Improving Natural Language Explanation Generation by Learning in Tandem COLING 2025

Cognitive Biases, Task Complexity, and Result Interpretability in Large Language Models COLING 2025

Towards Faithful Multi-step Reasoning through Fine-Grained Causal-aware Attribution Reasoning Distillation COLING 2025

Discrete Subgraph Sampling for Interpretable Graph based Visual Question Answering COLING 2025

A Compliance Checking Framework Based on Retrieval Augmented Generation COLING 2025

Towards Understanding Multi-Task Learning (Generalization) of LLMs via Detecting and Exploring Task-Specific Neurons COLING 2025

Revisiting Jailbreaking for Large Language Models: A Representation Engineering Perspective COLING 2025

NovAScore: A New Automated Metric for Evaluating Document Level Novelty COLING 2025

Assessing the Human Likeness of AI-Generated Counterspeech COLING 2025

DiL: An Explainable and Practical Metric for Abnormal Uncertainty in Object Detection WACV 2025

Towards a Universal Synthetic Video Detector: From Face or Background Manipulations to Fully AI-Generated Content CVPR 2025

AIM: Amending Inherent Interpretability via Self-Supervised Masking ICCV 2025

Bridging the Gap Between Ideal and Real-world Evaluation: Benchmarking AI-Generated Image Detection in Challenging Scenarios ICCV 2025

Leveraging Spatial Invariance to Boost Adversarial Transferability ICCV 2025

Derailer-Rerailer: Adaptive Verification for Efficient and Reliable Language Model Reasoning ACL 2025

TruthPrInt: Mitigating Large Vision-Language Models Object Hallucination Via Latent Truthful-Guided Pre-Intervention ICCV 2025

On the Complexity-Faithfulness Trade-off of Gradient-Based Explanations ICCV 2025

NCL-UoR at SemEval-2025 Task 3: Detecting Multilingual Hallucination and Related Observable Overgeneration Text Spans with Modified RefChecker and Modified SelfCheckGPT SEMEVAL 2025

Hallucination Detectives at SemEval-2025 Task 3: Span-Level Hallucination Detection for LLM-Generated Answers SEMEVAL 2025

Are You Trying to Convince Me or Are You Trying to Deceive Me? Using Argumentation Types to Identify Deceptive News ACL 2025

LLMSR@XLLM25: SWRV: Empowering Self-Verification of Small Language Models through Step-wise Reasoning and Verification ACL 2025

LLMSR@XLLM25: An Empirical Study of LLM for Structural Reasoning ACL 2025

PALI-NLP at SemEval 2025 Task 1: Multimodal Idiom Recognition and Alignment ACL 2025

CSECU-Learners at SemEval-2025 Task 9: Enhancing Transformer Model for Explainable Food Hazard Detection in Text ACL 2025

LCTeam at SemEval-2025 Task 3: Multilingual Detection of Hallucinations and Overgeneration Mistakes Using XLM-RoBERTa ACL 2025