conftrace_

Artificial Intelligence › Core AI ›

Interpretability

7,318 papers

Papers per year

Papers

Propagation and Pitfalls: Reasoning-based Assessment of Knowledge Editing through Counterfactual Tasks ACL 2024

Paying Attention to Deflections: Mining Pragmatic Nuances for Whataboutism Detection in Online Discourse ACL 2024

Epistemology of Language Models: Do Language Models Have Holistic Knowledge? ACL 2024

Competition-Level Problems are Effective LLM Evaluators ACL 2024

Argument-Based Sentiment Analysis on Forward-Looking Statements ACL 2024

LLMs cannot find reasoning errors, but can correct them given the error location ACL 2024

TextGenSHAP: Scalable Post-Hoc Explanations in Text Generation with Long Documents ACL 2024

Learning Fine-Grained Grounded Citations for Attributed Large Language Models ACL 2024

Unsupervised Real-Time Hallucination Detection based on the Internal States of Large Language Models ACL 2024

Preemptive Answer “Attacks” on Chain-of-Thought Reasoning ACL 2024

Automatic Bug Detection in LLM-Powered Text-Based Games Using LLMs ACL 2024

What Makes Language Models Good-enough? ACL 2024

TELLER: A Trustworthy Framework for Explainable, Generalizable and Controllable Fake News Detection ACL 2024

Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive Decoding ACL 2024

A Critical Study of What Code-LLMs (Do Not) Learn ACL 2024

On the Robustness of Document-Level Relation Extraction Models to Entity Name Variations ACL 2024

How and where does CLIP process negation? ACL 2024

Improving Language Models Trained on Translated Data with Continual Pre-Training and Dictionary Learning Analysis ACL 2024

CUET_sstm at ArAIEval Shared Task: Unimodal (Text) Propagandistic Technique Detection Using Transformer-Based Model ACL 2024

Uot1 at FIGNEWS 2024 Shared Task: Labeling News Bias ACL 2024

The Guidelines Specialists at FIGNEWS 2024 Shared Task: An annotation guideline to Unravel Bias in News Media Narratives Using a Linguistic Approach ACL 2024

Evaluating the Robustness of Adverse Drug Event Classification Models using Templates ACL 2024

Automatic Extraction of Disease Risk Factors from Medical Publications ACL 2024

XAI for Better Exploitation of Text in Medical Decision Support ACL 2024

Large language models fail to derive atypicality inferences in a human-like manner ACL 2024