Zeming Wei
7 papers · 2023–2025 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+1 more ↓ Show less ↑
π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (3) π Cross-Pollinator (12) πΊοΈ Taxonomy Completionist (22)
π£
Hot Topic Early Bird
Conferences
NIPS (4)
ICML (2)
CVPR (1)
Top co-authors
Keywords
large language model
(3)
adversarial training
(2)
adversarial robustness
(2)
self-supervised learning
(1)
in-context learning
(1)
model safety
(1)
model editing
(1)
safety alignment
(1)
inductive bia
(1)
node classification
(1)
representation engineering
(1)
jailbreaking attack
(1)
weight averaging
(1)
prompt optimization
(1)
softmax attention
(1)
multi-head attention
(1)
reward mechanism
(1)
graph contrastive learning
(1)
concept editing
(1)
llm security
(1)
Papers
Identifying and Understanding Cross-Class Features in Adversarial Training
ICML 2025
Fight Back Against Jailbreaking via Prompt Adversarial Tuning
NIPS 2024
A Theoretical Understanding of Self-Correction through In-context Alignment
NIPS 2024
Adversarial Representation Engineering: A General Model Editing Framework for Large Language Models
NIPS 2024
On the Duality Between Sharpness-Aware Minimization and Adversarial Training
ICML 2024
Architecture Matters: Uncovering Implicit Mechanisms in Graph Contrastive Learning
NIPS 2023
CFA: Class-Wise Calibrated Fair Adversarial Training
CVPR 2023