conftrace_

Artificial Intelligence › Core AI ›

Interpretability

7,318 papers

Papers per year

Papers

CHARP: Conversation History AwaReness Probing for Knowledge-grounded Dialogue Systems ACL 2024

k-SemStamp: A Clustering-Based Semantic Watermark for Detection of Machine-Generated Text ACL 2024

A Graph per Persona: Reasoning about Subjective Natural Language Descriptions ACL 2024

Direct Evaluation of Chain-of-Thought in Multi-hop Reasoning with Knowledge Graphs ACL 2024

Concept-Best-Matching: Evaluating Compositionality In Emergent Communication ACL 2024

Enhancing Semantic Consistency of Large Language Models through Model Editing: An Interpretability-Oriented Approach ACL 2024

Data-Centric Explainable Debiasing for Improving Fairness in Pre-trained Language Models ACL 2024

Just Ask One More Time! Self-Agreement Improves Reasoning of Language Models in (Almost) All Scenarios ACL 2024

Discerning and Resolving Knowledge Conflicts through Adaptive Decoding with Contextual Information-Entropy Constraint ACL 2024

A Mechanistic Analysis of a Transformer Trained on a Symbolic Multi-Step Reasoning Task ACL 2024

SyntaxShap: Syntax-aware Explainability Method for Text Generation ACL 2024

Automated Detection and Analysis of Data Practices Using A Real-World Corpus ACL 2024

Decomposing Co-occurrence Matrices into Interpretable Components as Formal Concepts ACL 2024

Locating and Extracting Relational Concepts in Large Language Models ACL 2024

Plausible Extractive Rationalization through Semi-Supervised Entailment Signal ACL 2024

Which Information Matters? Dissecting Human-written Multi-document Summaries with Partial Information Decomposition ACL 2024

Unveiling Selection Biases: Exploring Order and Token Sensitivity in Large Language Models ACL 2024

LJPCheck: Functional Tests for Legal Judgment Prediction ACL 2024

When to Trust LLMs: Aligning Confidence with Response Quality ACL 2024

Mitigating Hallucinations in Large Vision-Language Models (LVLMs) via Language-Contrastive Decoding (LCD) ACL 2024

Exploring Spatial Schema Intuitions in Large Language and Vision Models ACL 2024

Too Big to Fail: Larger Language Models are Disproportionately Resilient to Induction of Dementia-Related Linguistic Anomalies ACL 2024

Knowledge of Knowledge: Exploring Known-Unknowns Uncertainty with Large Language Models ACL 2024

Identifying Semantic Induction Heads to Understand In-Context Learning ACL 2024

Spotting AI’s Touch: Identifying LLM-Paraphrased Spans in Text ACL 2024