Zhangchen Xu
10 papers · 2024–2026 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
π Conference Polyglot (5) π Interdisciplinary Bridge π§ Keyword Pioneer π£ Hot Topic Early Bird π Cross-Pollinator (5)
πΊοΈ
Taxonomy Completionist
(21)
β‘
Prolific Year
(6)
Conferences
ACL (6)
AAAI (1)
EMNLP (1)
ICLR (1)
NAACL (1)
Top co-authors
Keywords
large language model
(8)
safety alignment
(3)
jailbreak attack
(3)
adversarial attack
(2)
instruction tuning
(2)
adversarial learning
(2)
decoding strategy
(2)
llm safety
(2)
knowledge distillation
(2)
chain-of-thought reasoning
(2)
code generation
(1)
backdoor attack
(1)
text generation
(1)
synthetic datum
(1)
harmful content
(1)
adversarial prompt
(1)
model safety
(1)
safety evaluation
(1)
supervised fine-tuning
(1)
adversarial training
(1)
Papers
Temporal Sampling for Forgotten Reasoning in LLMs
ACL 2026
Small Models Struggle to Learn from Strong Reasoners
ACL 2025
ChatBug: A Common Vulnerability of Aligned LLMs Induced by Chat Templates
AAAI 2025
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing
ICLR 2025
Stronger Models are Not Always Stronger Teachers for Instruction Tuning
NAACL 2025
KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for Coding
ACL 2025
SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities
ACL 2025
ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs
ACL 2024
SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding
ACL 2024
CleanGen: Mitigating Backdoor Attacks for Generation Tasks in Large Language Models
EMNLP 2024