conftrace_

Ranjan Satapathy

8 papers · 2024–2026 · 4 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+2 more ↓

🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (3) 🌈 Renaissance Researcher (5) 🐝 Cross-Pollinator (15) 🗺️ Taxonomy Completionist (14)

🧭 Keyword Pioneer ❓ The Questioner

Conferences

EMNLP (4) NAACL (2) AAAI (1) ACL (1)

Top co-authors

Erik Cambria (7) Yeo Wei Jie (5) Roy Ka-Wei Lee (2) Wei Jie Yeo (2) Rick Goh (2) Nirmalendu Prakash (2) Clement Neo (1) Huang Hejia (1) Xiaoneng Xiang (1) Przemyslaw Kazienko (1)

Keywords

large language model (4) causal mediation (2) sparse autoencoder (2) refusal behavior (2) chain-of-thought reasoning (1) text generation (1) interpretable machine learning (1) model interpretability (1) mechanistic interpretability (1) jailbreak attack (1) hallucination reduction (1) model interpretation (1) natural language explanation (1) activation patching (1) feature intervention (1) sustainability reporting (1) faithfulness measurement (1) causal faithfulness (1) environmental social governance (1) extractive rationalization (1)

Papers

Beyond I’m Sorry, I Can’t: Dissecting Large-Language-Model Refusal AAAI 2026 Towards Faithful Natural Language Explanations: A Study Using Activation Patching in Large Language Models EMNLP 2025 Understanding Refusal in Language Models with Sparse Autoencoders EMNLP 2025 From Earnings Calls to Investment Reports: Evaluating Role-based Multi-Agent LLM Systems EMNLP 2025 SusGen-GPT: A Data-Centric LLM for Financial NLP and Sustainability Report Generation NAACL 2025 How Interpretable are Reasoning Explanations from Prompting Large Language Models? NAACL 2024 Self-training Large Language Models through Knowledge Detection EMNLP 2024 Plausible Extractive Rationalization through Semi-Supervised Entailment Signal ACL 2024