Arka Dutta
5 papers · 2024–2026 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓
🌍
Conference Polyglot
(2)
🧭
Keyword Pioneer
🐝
Cross-Pollinator
(15)
❓
The Questioner
Conferences
AAAI (2)
IJCAI (2)
ACL (1)
Top co-authors
Keywords
large language model
(2)
responsible ai
(2)
ai safety
(1)
implicit bia
(1)
hallucination detection
(1)
social media
(1)
stress testing
(1)
participatory design
(1)
bias audit
(1)
bias auditing
(1)
vicarious interaction
(1)
participatory ai design
(1)
de-escalation content
(1)
adversarial nudge
(1)
social media analysis
(1)
factual fidelity
(1)
hope speech detection
(1)
Papers
How Can You Tell if Your Large Language Model Could Be a Closet Antisemite? An Explainability-Based Audit Framework for Implicit Bias
AAAI 2026
What About the Scene With the Hitler Reference? HAUNT: A Framework to Probe LLMs’ Self-consistency in Closed Domains Via Adversarial Nudge
ACL 2026
All You Need Is S P A C E: When Jailbreaking Meets Bias Audit and Reveals What Lies Beneath the Guardrails (Student Abstract)
AAAI 2025
Towards a Bipartisan Understanding of Peace and Vicarious Interactions
IJCAI 2025
Down the Toxicity Rabbit Hole: A Framework to Bias Audit Large Language Models with Key Emphasis on Racism, Antisemitism, and Misogyny
IJCAI 2024