conftrace_

Fengqing Jiang

11 papers · 2024–2026 · 5 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+3 more ↓

🐣 Hot Topic Early Bird 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (5) 🐝 Cross-Pollinator (5)

🗺️ Taxonomy Completionist (17) 🏆 Keyword Champion (2) ⚡ Prolific Year (5)

Conferences

ACL (6) ICLR (2) AAAI (1) EMNLP (1) NAACL (1)

Top co-authors

Radha Poovendran (11) Luyao Niu (10) Zhangchen Xu (9) Bill Yuchen Lin (7) Bhaskar Ramasubramanian (5) Yuetai Li (5) Zhen Xiang (3) Bo Li (3) Xiang Yue (2) Yuntian Deng (1)

Keywords

large language model (8) jailbreak attack (3) safety alignment (3) adversarial attack (2) instruction tuning (2) knowledge distillation (2) chain-of-thought reasoning (2) decoding strategy (2) llm safety (2) adversarial learning (2) model safety (1) safety evaluation (1) adversarial training (1) adversarial prompt (1) reasoning trace (1) backdoor attack (1) reasoning benchmark (1) token probability (1) reasoning model (1) harmful content (1)

Papers

Temporal Sampling for Forgotten Reasoning in LLMs ACL 2026 BadScientist: Can a Research Agent Write Convincing but Unsound Papers that Fool LLM Reviewers? ACL 2026 Small Models Struggle to Learn from Strong Reasoners ACL 2025 ChatBug: A Common Vulnerability of Aligned LLMs Induced by Chat Templates AAAI 2025 SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities ACL 2025 Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing ICLR 2025 Stronger Models are Not Always Stronger Teachers for Instruction Tuning NAACL 2025 ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs ACL 2024 SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding ACL 2024 CleanGen: Mitigating Backdoor Attacks for Generation Tasks in Large Language Models EMNLP 2024 BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models ICLR 2024