Zixuan Weng
3 papers · 2025–2026 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓
🌍
Conference Polyglot
(2)
🌉
Interdisciplinary Bridge
🐝
Cross-Pollinator
(15)
Conferences
ACL (1)
EMNLP (1)
ICCV (1)
Top co-authors
Keywords
jailbreak attack
(2)
safety alignment
(1)
adversarial attack
(1)
diffusion model
(1)
mixture of expert
(1)
multi-turn interaction
(1)
safety evaluation
(1)
adversarial prompt
(1)
safety benchmark
(1)
representation steering
(1)
inference-time steering
(1)
toxic response
(1)
language model alignment
(1)
large language model
(1)