conftrace_

Zeen Zhu

1 papers · 2026–2026 · 1 conference · across top CS/AI conferences

Conferences

ACL (1)

Top co-authors

Jing Li (1) Zesheng Shi (1) Min Zhang (1) Weiyang Guo (1) Yuan Zhou (1)

Keywords

backdoor attack (1) jailbreak attack (1) harmful response (1) reinforcement learning with verifiable reward (1) asymmetric chain backdoor (1) poisoning datum (1)

Papers

Backdoors in RLVR: Jailbreak Backdoors in LLMs From Verifiable Reward ACL 2026