Weilong Dong
5 papers · 2023–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+3 more ↓ Show less ↑
π Cross-Pollinator (15) π Conference Polyglot (4) π Renaissance Researcher (5) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (13)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
β
The Questioner
Conferences
EMNLP (2)
ACL (1)
COLING (1)
NIPS (1)
Top co-authors
Research topics
Keywords
large language model
(3)
value alignment
(2)
neuron editing
(2)
integrated gradient
(2)
privacy neuron
(2)
model editing
(2)
privacy protection
(1)
privacy leakage
(1)
pretrained language model
(1)
multilingual natural language processing
(1)
parametric knowledge
(1)
knowledge conflict
(1)
activation patching
(1)
context processing
(1)
concept vector
(1)
memorization capability
(1)
activation value
(1)
context-aware neuron
(1)
model compression
(1)
neuron reweighting
(1)
Papers
CONTRANS: Weak-to-Strong Alignment Engineering via Concept Transplantation
COLING 2025
IRCAN: Mitigating Knowledge Conflicts in LLM Generation via Identifying and Reweighting Context-Aware Neurons
NIPS 2024
Mitigating Privacy Seesaw in Large Language Models: Augmented Privacy Neuron Editing via Activation Patching
ACL 2024
Exploring Multilingual Concepts of Human Values in Large Language Models: Is Value Alignment Consistent, Transferable and Controllable across Languages?
EMNLP 2024
DEPN: Detecting and Editing Privacy Neurons in Pretrained Language Models
EMNLP 2023