Lirong Qiu
4 papers · 2024–2026 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓
🌍
Conference Polyglot
(2)
🌉
Interdisciplinary Bridge
🧭
Keyword Pioneer
🐝
Cross-Pollinator
(15)
Conferences
AAAI (1)
ACL (1)
EMNLP (1)
IJCAI (1)
Top co-authors
Keywords
jailbreak attack
(2)
adversarial learning
(1)
adversarial attack
(1)
supervised fine-tuning
(1)
jailbreak defense
(1)
large language model
(1)
adaptive defense
(1)
prompt calibration
(1)
intention shift
(1)
mirror crafting
(1)
semantic safety
(1)
entropy guidance
(1)
reinforcement learning
(1)
cognitive defense
(1)
Papers
MirrorShield: Towards Dynamic Adaptive Defense Against Jailbreaks via Entropy-Guided Mirror Crafting
AAAI 2026
Beyond Surface-Level Detection: Towards Cognitive-Driven Defense Against Jailbreak Attacks via Meta-Operations Reasoning
ACL 2026
Feint and Attack: Jailbreaking and Protecting LLMs via Attention Distribution Modeling
IJCAI 2025
BaitAttack: Alleviating Intention Shift in Jailbreak Attacks via Adaptive Bait Crafting
EMNLP 2024