conftrace
_
Papers
Trends
Conferences
Explore
More
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Authors
Copy link
Zeen Zhu
1 papers · 2026–2026 · 1 conference
· across top CS/AI conferences
Conferences
ACL (1)
Top co-authors
Jing Li (1)
Zesheng Shi (1)
Min Zhang (1)
Weiyang Guo (1)
Yuan Zhou (1)
Keywords
backdoor attack
(1)
jailbreak attack
(1)
harmful response
(1)
reinforcement learning with verifiable reward
(1)
asymmetric chain backdoor
(1)
poisoning datum
(1)
Papers
Backdoors in RLVR: Jailbreak Backdoors in LLMs From Verifiable Reward
ACL 2026