Yige Li
13 papers · 2021–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+7 more ↓ Show less ↑
🐝 Cross-Pollinator (12) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (6) 🏃 Academic Marathon (5)
🌉
Interdisciplinary Bridge
🧭
Keyword Pioneer
🌍
Conference Polyglot
(6)
🏆
Grand Slam
⚡
Prolific Year
(8)
❓
The Questioner
💎
Century Club
(12)
Conferences
EMNLP (3)
ICLR (3)
ICML (3)
AAAI (1)
CVPR (1)
EACL (1)
NIPS (1)
Top co-authors
Research topics
Keywords
large language model
(3)
backdoor attack
(3)
vision-language model
(2)
adversarial learning
(2)
adversarial attack
(2)
backdoor defense
(2)
safety alignment
(1)
model safety
(1)
feature extraction
(1)
model behavior
(1)
model unlearning
(1)
machine unlearning
(1)
neural network security
(1)
model alignment
(1)
multimodal learning
(1)
empirical evaluation
(1)
approximation error
(1)
self-supervised learning
(1)
deep neural network
(1)
model security
(1)
Papers
Unleashing the Unseen: Harnessing Benign Datasets for Jailbreaking Large Language Models
EACL 2026
Anyattack: Towards Large-scale Self-supervised Adversarial Attacks on Vision-language Models
CVPR 2025
X-Transfer Attacks: Towards Super Transferable Adversarial Attacks on CLIP
ICML 2025
BlueSuffix: Reinforced Blue Teaming for Vision-Language Models Against Jailbreak Attacks
ICLR 2025
CROW: Eliminating Backdoors from Large Language Models via Internal Consistency Regularization
ICML 2025
Zero-Shot Defense Against Toxic Images via Inherent Multimodal Alignment in LVLMs
EMNLP 2025
Do Influence Functions Work on Large Language Models?
EMNLP 2025
Backdoor Token Unlearning: Exposing and Defending Backdoors in Pretrained Language Models
AAAI 2025
Detecting Backdoor Samples in Contrastive Language Image Pretraining
ICLR 2025
Defending Large Language Models Against Jailbreak Attacks via Layer-specific Editing
EMNLP 2024
Reconstructive Neuron Pruning for Backdoor Defense
ICML 2023
Neural Attention Distillation: Erasing Backdoor Triggers from Deep Neural Networks
ICLR 2021
Anti-Backdoor Learning: Training Clean Models on Poisoned Data
NIPS 2021