conftrace_

Zeming Wei

7 papers · 2023–2025 · 3 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+1 more ↓

🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (3) 🐝 Cross-Pollinator (12) 🗺️ Taxonomy Completionist (22)

🐣 Hot Topic Early Bird

Conferences

NIPS (4) ICML (2) CVPR (1)

Top co-authors

Yisen Wang (5) Yifei Wang (4) Yihao Zhang (2) Xiaojun Guo (1) Huanran Chen (1) Stefanie Jegelka (1) Jun Sun (1) Hangzhou He (1) Meng Sun (1) Yiwen Guo (1)

Keywords

large language model (3) adversarial training (2) adversarial robustness (2) self-supervised learning (1) in-context learning (1) model safety (1) model editing (1) safety alignment (1) inductive bia (1) node classification (1) representation engineering (1) jailbreaking attack (1) weight averaging (1) prompt optimization (1) softmax attention (1) multi-head attention (1) reward mechanism (1) graph contrastive learning (1) concept editing (1) llm security (1)

Papers

Identifying and Understanding Cross-Class Features in Adversarial Training ICML 2025 Fight Back Against Jailbreaking via Prompt Adversarial Tuning NIPS 2024 A Theoretical Understanding of Self-Correction through In-context Alignment NIPS 2024 Adversarial Representation Engineering: A General Model Editing Framework for Large Language Models NIPS 2024 On the Duality Between Sharpness-Aware Minimization and Adversarial Training ICML 2024 Architecture Matters: Uncovering Implicit Mechanisms in Graph Contrastive Learning NIPS 2023 CFA: Class-Wise Calibrated Fair Adversarial Training CVPR 2023