conftrace_

Zhangchen Xu

10 papers · 2024–2026 · 5 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+2 more ↓

🌍 Conference Polyglot (5) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🐝 Cross-Pollinator (5)

🗺️ Taxonomy Completionist (21) ⚡ Prolific Year (6)

Conferences

ACL (6) AAAI (1) EMNLP (1) ICLR (1) NAACL (1)

Top co-authors

Radha Poovendran (10) Luyao Niu (9) Fengqing Jiang (9) Bill Yuchen Lin (7) Bhaskar Ramasubramanian (4) Yuetai Li (4) Zhen Xiang (2) Bo Li (2) Xiang Yue (2) Yang Liu (1)

Keywords

large language model (8) safety alignment (3) jailbreak attack (3) adversarial attack (2) instruction tuning (2) adversarial learning (2) decoding strategy (2) llm safety (2) knowledge distillation (2) chain-of-thought reasoning (2) code generation (1) backdoor attack (1) text generation (1) synthetic datum (1) harmful content (1) adversarial prompt (1) model safety (1) safety evaluation (1) supervised fine-tuning (1) adversarial training (1)

Papers

Temporal Sampling for Forgotten Reasoning in LLMs ACL 2026 Small Models Struggle to Learn from Strong Reasoners ACL 2025 ChatBug: A Common Vulnerability of Aligned LLMs Induced by Chat Templates AAAI 2025 Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing ICLR 2025 Stronger Models are Not Always Stronger Teachers for Instruction Tuning NAACL 2025 KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for Coding ACL 2025 SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities ACL 2025 ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs ACL 2024 SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding ACL 2024 CleanGen: Mitigating Backdoor Attacks for Generation Tasks in Large Language Models EMNLP 2024