conftrace_

Mengru Wang

12 papers · 2022–2026 · 5 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+3 more ↓

🌍 Conference Polyglot (5) 🐝 Cross-Pollinator (12) 🗺️ Taxonomy Completionist (25) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge

💎 Century Club (11) ⚡ Prolific Year (6) ❓ The Questioner

Conferences

ACL (5) EMNLP (4) COLING (1) ICLR (1) NIPS (1)

Top co-authors

Huajun Chen (10) Ningyu Zhang (10) Ziwen Xu (8) Shumin Deng (7) Yunzhi Yao (7) Zekun Xi (3) Guozhou Zheng (2) Bozhong Tian (2) Siyuan Cheng (2) Bryan Hooi (2)

Keywords

large language model (6) knowledge editing (5) model editing (3) model safety (2) multimodal large language model (1) machine unlearning (1) few-shot learning (1) language model (1) relation extraction (1) model fine-tuning (1) knowledge forgetting (1) parameter-efficient tuning (1) sparse autoencoder (1) control signal (1) safety benchmark (1) steering vector (1) knowledge unlearning (1) adversarial input (1) computation graph (1) adversarial learning (1)

Papers

Why Steering Works: Toward a Unified View of Language Model Parameter Dynamics ACL 2026 ReLearn: Unlearning via Learning for Large Language Models ACL 2025 Beyond Prompt Engineering: Robust Behavior Control in LLMs via Steering Target Atoms ACL 2025 Automating Steering for Safe Multimodal Large Language Models EMNLP 2025 EasyEdit2: An Easy-to-use Steering Framework for Editing Large Language Models EMNLP 2025 Unveiling the Pitfalls of Knowledge Editing for Large Language Models ICLR 2024 To Forget or Not? Towards Practical Knowledge Unlearning for Large Language Models EMNLP 2024 Detoxifying Large Language Models via Knowledge Editing ACL 2024 EasyEdit: An Easy-to-use Knowledge Editing Framework for Large Language Models ACL 2024 Knowledge Circuits in Pretrained Transformers NIPS 2024 Knowledge Mechanisms in Large Language Models: A Survey and Perspective EMNLP 2024 DRK: Discriminative Rule-based Knowledge for Relieving Prediction Confusions in Few-shot Relation Extraction COLING 2022