Mengru Wang
12 papers · 2022–2026 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+3 more ↓ Show less ↑
π Conference Polyglot (5) π Cross-Pollinator (12) πΊοΈ Taxonomy Completionist (25) π§ Keyword Pioneer π Interdisciplinary Bridge
π
Century Club
(11)
β‘
Prolific Year
(6)
β
The Questioner
Conferences
ACL (5)
EMNLP (4)
COLING (1)
ICLR (1)
NIPS (1)
Top co-authors
Keywords
large language model
(6)
knowledge editing
(5)
model editing
(3)
model safety
(2)
multimodal large language model
(1)
machine unlearning
(1)
few-shot learning
(1)
language model
(1)
relation extraction
(1)
model fine-tuning
(1)
knowledge forgetting
(1)
parameter-efficient tuning
(1)
sparse autoencoder
(1)
control signal
(1)
safety benchmark
(1)
steering vector
(1)
knowledge unlearning
(1)
adversarial input
(1)
computation graph
(1)
adversarial learning
(1)
Papers
Why Steering Works: Toward a Unified View of Language Model Parameter Dynamics
ACL 2026
ReLearn: Unlearning via Learning for Large Language Models
ACL 2025
Beyond Prompt Engineering: Robust Behavior Control in LLMs via Steering Target Atoms
ACL 2025
Automating Steering for Safe Multimodal Large Language Models
EMNLP 2025
EasyEdit2: An Easy-to-use Steering Framework for Editing Large Language Models
EMNLP 2025
Unveiling the Pitfalls of Knowledge Editing for Large Language Models
ICLR 2024
To Forget or Not? Towards Practical Knowledge Unlearning for Large Language Models
EMNLP 2024
Detoxifying Large Language Models via Knowledge Editing
ACL 2024
EasyEdit: An Easy-to-use Knowledge Editing Framework for Large Language Models
ACL 2024
Knowledge Circuits in Pretrained Transformers
NIPS 2024
Knowledge Mechanisms in Large Language Models: A Survey and Perspective
EMNLP 2024
DRK: Discriminative Rule-based Knowledge for Relieving Prediction Confusions in Few-shot Relation Extraction
COLING 2022