conftrace_

Artificial Intelligence › Core AI ›

Interpretability

7,318 papers

Papers per year

Papers

Make Your Decision Convincing! A Unified Two-Stage Framework: Self-Attribution and Decision-Making EMNLP 2023

Self-Supervised Rule Learning to Link Text Segments to Relational Elements of Structured Knowledge EMNLP 2023

Towards Mitigating LLM Hallucination via Self Reflection EMNLP 2023

ExplainCPE: A Free-text Explanation Benchmark of Chinese Pharmacist Examination EMNLP 2023

Reducing Spurious Correlations in Aspect-based Sentiment Analysis with Explanation from Large Language Models EMNLP 2023

On Uncertainty Calibration and Selective Generation in Probabilistic Neural Summarization: A Benchmark Study EMNLP 2023

Measuring Faithful and Plausible Visual Grounding in VQA EMNLP 2023

Extractive Summarization via ChatGPT for Faithful Summary Generation EMNLP 2023

PsyAttention: Psychological Attention Model for Personality Detection EMNLP 2023

Test-time Augmentation for Factual Probing EMNLP 2023

A New Benchmark and Reverse Validation Method for Passage-level Hallucination Detection EMNLP 2023

APP: Adaptive Prototypical Pseudo-Labeling for Few-shot OOD Detection EMNLP 2023

Proto-lm: A Prototypical Network-Based Framework for Built-in Interpretability in Large Language Models EMNLP 2023

Transparency at the Source: Evaluating and Interpreting Language Models With Access to the True Distribution EMNLP 2023

Statistically Profiling Biases in Natural Language Reasoning Datasets and Models EMNLP 2023

Verb Conjugation in Transformers Is Determined by Linear Encodings of Subject Number EMNLP 2023

That was the last straw, we need more: Are Translation Systems Sensitive to Disambiguating Context? EMNLP 2023

MindGames: Targeting Theory of Mind in Large Language Models with Dynamic Epistemic Modal Logic EMNLP 2023

ZARA: Improving Few-Shot Self-Rationalization for Small Language Models EMNLP 2023

Robustness Tests for Automatic Machine Translation Metrics with Adversarial Attacks EMNLP 2023

Is Probing All You Need? Indicator Tasks as an Alternative to Probing Embedding Spaces EMNLP 2023

DeltaScore: Fine-Grained Story Evaluation with Perturbations EMNLP 2023

InterroLang: Exploring NLP Models and Datasets through Dialogue-based Explanations EMNLP 2023

INVITE: a Testbed of Automatically Generated Invalid Questions to Evaluate Large Language Models for Hallucinations EMNLP 2023

HARE: Explainable Hate Speech Detection with Step-by-Step Reasoning EMNLP 2023