Co-occurring keywords
Papers
Lose Your Self (LoYS): An Adversarial Entropy-based Unsupervised Approach for Model Debiasing
WACV 2026
Attack the Messages, Not the Agents: A Multi-round Adaptive Stealthy Tampering Framework for LLM-MAS
AAAI 2026
When Prompt Optimization Becomes Jailbreaking: Adaptive Red-Teaming of Large Language Models
EACL 2026
Toward the Frontiers of Reliable Diffusion Sampling via Adversarial Sinkhorn Attention Guidance
AAAI 2026
Silent Branding Attack: Trigger-free Data Poisoning Attack on Text-to-Image Diffusion Models
CVPR 2025
Generative Adversarial Diffusion
ICCV 2025