Manish Nagireddy
9 papers · 2024–2026 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (14) π Cross-Pollinator (15) π Interdisciplinary Bridge π Conference Polyglot (6) π§ Keyword Pioneer
π£
Hot Topic Early Bird
π₯
Mega-Team
(22)
Conferences
ACL (2)
EMNLP (2)
NAACL (2)
AAAI (1)
ICLR (1)
IJCAI (1)
Top co-authors
Keywords
large language model
(5)
value alignment
(2)
dialogue system
(2)
ai safety
(1)
synthetic data generation
(1)
language model
(1)
generative ai
(1)
hallucination detection
(1)
harmful content detection
(1)
multimodal model
(1)
reasoning trace
(1)
adversarial testing
(1)
human-ai interaction
(1)
faithful explanation
(1)
attribution method
(1)
explanation method
(1)
social bias detection
(1)
generative language model
(1)
moral reasoning
(1)
question answering benchmark
(1)
Papers
Answering the Wrong Question: Reasoning Trace Inversion for Abstention in LLMs
ACL 2026
Multi-Level Explanations for Generative Language Models
ACL 2025
Programming Refusal with Conditional Activation Steering
ICLR 2025
Granite Guardian: Comprehensive LLM Safeguarding
NAACL 2025
DAMAGeR: Deploying Automatic and Manual Approaches to GenAI Red-teaming
NAACL 2025
ComVas: Contextual Moral Values Alignment System
IJCAI 2024
Value Alignment from Unstructured Text
EMNLP 2024
Language Models in Dialogue: Conversational Maxims for Human-AI Interactions
EMNLP 2024
SocialStigmaQA: A Benchmark to Uncover Stigma Amplification in Generative Language Models
AAAI 2024