Agam Goyal
6 papers · 2024–2025 · 2 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+1 more ↓ Show less ↑
π Conference Polyglot (2) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (20) π§ Keyword Pioneer π£ Hot Topic Early Bird
π
Cross-Pollinator
(15)
Conferences
EMNLP (4)
NAACL (2)
Top co-authors
Keywords
large language model
(4)
text classification
(2)
content moderation
(2)
argument extraction
(1)
toxicity detection
(1)
text summarization
(1)
explainable ai
(1)
ai safety
(1)
network alignment
(1)
harmful content
(1)
benchmark dataset
(1)
language model
(1)
mixture of expert
(1)
sparse autoencoder
(1)
role-playing agent
(1)
harmful content detection
(1)
causal intervention
(1)
residual stream
(1)
jailbreak defense
(1)
multi-agent simulation
(1)
Papers
MoMoE: Mixture of Moderation Experts Framework for AI-Assisted Online Governance
EMNLP 2025
Breaking Bad Tokens: Detoxification of LLMs Using Sparse Autoencoders
EMNLP 2025
ArgCMV: An Argument Summarization Benchmark for the LLM-era
EMNLP 2025
SLM-Mod: Small Language Models Surpass LLMs at Content Moderation
NAACL 2025
Beyond Demographics: Aligning Role-playing LLM-based Agents Using Human Belief Networks
EMNLP 2024
Simulating Opinion Dynamics with Networks of LLM-based Agents
NAACL 2024