conftrace_

Yige Li

13 papers · 2021–2026 · 7 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+7 more ↓

🐝 Cross-Pollinator (12) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (6) 🏃 Academic Marathon (5)

🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (6) 🏆 Grand Slam ⚡ Prolific Year (8) ❓ The Questioner 💎 Century Club (12)

Conferences

EMNLP (3) ICLR (3) ICML (3) AAAI (1) CVPR (1) EACL (1) NIPS (1)

Top co-authors

Xingjun Ma (7) Jun Sun (5) Xixiang Lyu (4) Wei Zhao (4) Zhe Li (4) Bo Li (3) Nodens Koren (3) Lingjuan Lyu (3) Yu-Gang Jiang (2) Sarah Monazam Erfani (2)

Research topics

Keywords

large language model (3) backdoor attack (3) vision-language model (2) adversarial learning (2) adversarial attack (2) backdoor defense (2) safety alignment (1) model safety (1) feature extraction (1) model behavior (1) model unlearning (1) machine unlearning (1) neural network security (1) model alignment (1) multimodal learning (1) empirical evaluation (1) approximation error (1) self-supervised learning (1) deep neural network (1) model security (1)

Papers

Unleashing the Unseen: Harnessing Benign Datasets for Jailbreaking Large Language Models EACL 2026 Anyattack: Towards Large-scale Self-supervised Adversarial Attacks on Vision-language Models CVPR 2025 X-Transfer Attacks: Towards Super Transferable Adversarial Attacks on CLIP ICML 2025 BlueSuffix: Reinforced Blue Teaming for Vision-Language Models Against Jailbreak Attacks ICLR 2025 CROW: Eliminating Backdoors from Large Language Models via Internal Consistency Regularization ICML 2025 Zero-Shot Defense Against Toxic Images via Inherent Multimodal Alignment in LVLMs EMNLP 2025 Do Influence Functions Work on Large Language Models? EMNLP 2025 Backdoor Token Unlearning: Exposing and Defending Backdoors in Pretrained Language Models AAAI 2025 Detecting Backdoor Samples in Contrastive Language Image Pretraining ICLR 2025 Defending Large Language Models Against Jailbreak Attacks via Layer-specific Editing EMNLP 2024 Reconstructive Neuron Pruning for Backdoor Defense ICML 2023 Neural Attention Distillation: Erasing Backdoor Triggers from Deep Neural Networks ICLR 2021 Anti-Backdoor Learning: Training Clean Models on Poisoned Data NIPS 2021