Co-occurring keywords
Papers
When Prompt Optimization Becomes Jailbreaking: Adaptive Red-Teaming of Large Language Models
EACL 2026
BeDKD: Backdoor Defense Based on Directional Mapping Module and Adversarial Knowledge Distillation
AAAI 2026