Co-occurring keywords
Papers
SABER: Uncovering Vulnerabilities in Safety Alignment via Cross-Layer Residual Connection
EMNLP 2025
Safety in Large Reasoning Models: A Survey
EMNLP 2025
Obliviate: Neutralizing Task-agnostic Backdoors within the Parameter-efficient Fine-tuning Paradigm
NAACL 2025