conftrace_

Agam Goyal

6 papers · 2024–2025 · 2 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+1 more ↓

🌍 Conference Polyglot (2) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (20) 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird

🐝 Cross-Pollinator (15)

Conferences

EMNLP (4) NAACL (2)

Top co-authors

Eshwar Chandrasekharan (3) Yun-Shiuan Chuang (2) Junjie Hu (2) Sijia Yang (2) Koustuv Saha (2) Yilun Chen (2) Xianyang Zhan (2) Robert Hawkins (1) Dhavan V. Shah (1) Yian Wang (1)

Keywords

large language model (4) text classification (2) content moderation (2) argument extraction (1) toxicity detection (1) text summarization (1) explainable ai (1) ai safety (1) network alignment (1) harmful content (1) benchmark dataset (1) language model (1) mixture of expert (1) sparse autoencoder (1) role-playing agent (1) harmful content detection (1) causal intervention (1) residual stream (1) jailbreak defense (1) multi-agent simulation (1)

Papers

MoMoE: Mixture of Moderation Experts Framework for AI-Assisted Online Governance EMNLP 2025 Breaking Bad Tokens: Detoxification of LLMs Using Sparse Autoencoders EMNLP 2025 ArgCMV: An Argument Summarization Benchmark for the LLM-era EMNLP 2025 SLM-Mod: Small Language Models Surpass LLMs at Content Moderation NAACL 2025 Beyond Demographics: Aligning Role-playing LLM-based Agents Using Human Belief Networks EMNLP 2024 Simulating Opinion Dynamics with Networks of LLM-based Agents NAACL 2024