Fengqing Jiang
11 papers · 2024–2026 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+3 more ↓ Show less ↑
π£ Hot Topic Early Bird π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (5) π Cross-Pollinator (5)
πΊοΈ
Taxonomy Completionist
(17)
π
Keyword Champion
(2)
β‘
Prolific Year
(5)
Conferences
ACL (6)
ICLR (2)
AAAI (1)
EMNLP (1)
NAACL (1)
Top co-authors
Keywords
large language model
(8)
jailbreak attack
(3)
safety alignment
(3)
adversarial attack
(2)
instruction tuning
(2)
knowledge distillation
(2)
chain-of-thought reasoning
(2)
decoding strategy
(2)
llm safety
(2)
adversarial learning
(2)
model safety
(1)
safety evaluation
(1)
adversarial training
(1)
adversarial prompt
(1)
reasoning trace
(1)
backdoor attack
(1)
reasoning benchmark
(1)
token probability
(1)
reasoning model
(1)
harmful content
(1)
Papers
Temporal Sampling for Forgotten Reasoning in LLMs
ACL 2026
BadScientist: Can a Research Agent Write Convincing but Unsound Papers that Fool LLM Reviewers?
ACL 2026
Small Models Struggle to Learn from Strong Reasoners
ACL 2025
ChatBug: A Common Vulnerability of Aligned LLMs Induced by Chat Templates
AAAI 2025
SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities
ACL 2025
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing
ICLR 2025
Stronger Models are Not Always Stronger Teachers for Instruction Tuning
NAACL 2025
ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs
ACL 2024
SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding
ACL 2024
CleanGen: Mitigating Backdoor Attacks for Generation Tasks in Large Language Models
EMNLP 2024
BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models
ICLR 2024