Ranjan Satapathy
8 papers · 2024–2026 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
π Interdisciplinary Bridge π Conference Polyglot (3) π Renaissance Researcher (5) π Cross-Pollinator (15) πΊοΈ Taxonomy Completionist (14)
π§
Keyword Pioneer
β
The Questioner
Conferences
EMNLP (4)
NAACL (2)
AAAI (1)
ACL (1)
Top co-authors
Keywords
large language model
(4)
causal mediation
(2)
sparse autoencoder
(2)
refusal behavior
(2)
chain-of-thought reasoning
(1)
text generation
(1)
interpretable machine learning
(1)
model interpretability
(1)
mechanistic interpretability
(1)
jailbreak attack
(1)
hallucination reduction
(1)
model interpretation
(1)
natural language explanation
(1)
activation patching
(1)
feature intervention
(1)
sustainability reporting
(1)
faithfulness measurement
(1)
causal faithfulness
(1)
environmental social governance
(1)
extractive rationalization
(1)
Papers
Beyond Iβm Sorry, I Canβt: Dissecting Large-Language-Model Refusal
AAAI 2026
Towards Faithful Natural Language Explanations: A Study Using Activation Patching in Large Language Models
EMNLP 2025
Understanding Refusal in Language Models with Sparse Autoencoders
EMNLP 2025
From Earnings Calls to Investment Reports: Evaluating Role-based Multi-Agent LLM Systems
EMNLP 2025
SusGen-GPT: A Data-Centric LLM for Financial NLP and Sustainability Report Generation
NAACL 2025
How Interpretable are Reasoning Explanations from Prompting Large Language Models?
NAACL 2024
Self-training Large Language Models through Knowledge Detection
EMNLP 2024
Plausible Extractive Rationalization through Semi-Supervised Entailment Signal
ACL 2024