Zhenting Wang
28 papers · 2022–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (34) π Interdisciplinary Bridge π Renaissance Researcher (8) π Conference Polyglot (9) π§ Keyword Pioneer
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π€
Dynamic Duo
(14)
β
The Questioner
(4)
β‘
Prolific Year
(16)
π
Century Club
(27)
π₯
Unstoppable
(5)
ποΈ
Keyword Collector
(93)
Conferences
CVPR (6)
ICLR (5)
ACL (4)
ICML (4)
NAACL (3)
NIPS (3)
COLING (1)
ECCV (1)
WACV (1)
Top co-authors
Research topics
Keywords
backdoor attack
(6)
adversarial attack
(4)
neural network
(4)
large language model
(4)
trojan attack
(3)
adversarial learning
(3)
neural network security
(3)
chain-of-thought reasoning
(2)
model security
(2)
image classification
(2)
backdoor defense
(2)
backdoor detection
(2)
multimodal large language model
(2)
reverse engineering
(2)
ai-generated content
(1)
prompt engineering
(1)
in-context learning
(1)
zero-shot learning
(1)
model safety
(1)
self-supervised learning
(1)
Papers
Mitigating Backdoor Attacks via Trigger Reconstruction and Model Hardening
WACV 2026
Reasoning over Precedents Alongside Statutes: Case-Augmented Deliberative Alignment for LLM Safety
ACL 2026
EmojiPrompt: Generative Prompt Obfuscation for Privacy-Preserving Communication with Cloud-based LLMs
NAACL 2025
Token-Budget-Aware LLM Reasoning
ACL 2025
ADO: Automatic Data Optimization for Inputs in LLM Prompts
ACL 2025
Exploring Concept Depth: How Large Language Models Acquire Knowledge and Concept at Different Layers?
COLING 2025
Data-centric NLP Backdoor Defense from the Lens of Memorization
NAACL 2025
Invisible Backdoor Attack against Self-supervised Learning
CVPR 2025
MLLM-as-a-Judge for Image Safety without Human Labeling
CVPR 2025
Accelerating Multimodal Large Language Models by Searching Optimal Vision Token Reduction
CVPR 2025
CO-SPY: Combining Semantic and Pixel Features to Detect Synthetic Images by AI
CVPR 2025
The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models Via Visual Information Steering
ICML 2025
ProSec: Fortifying Code LLMs with Proactive Security Alignment
ICML 2025
Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents
ICLR 2025
Visual Agents as Fast and Slow Thinkers
ICLR 2025
How to Evaluate and Mitigate IP Infringement in Visual Generative AI?
ICML 2025
An Optimizable Suffix Is Worth A Thousand Templates: Efficient Black-box Jailbreaking without Affirmative Phrases via LLM as Optimizer
NAACL 2025
LoR-VP: Low-Rank Visual Prompting for Efficient Vision Model Adaptation
ICLR 2025
Finding a needle in a haystack: A Black-Box Approach to Invisible Watermark Detection
ECCV 2024
How to Trace Latent Generative Model Generated Images without Artificial Watermark?
ICML 2024
DIAGNOSIS: Detecting Unauthorized Data Usages in Text-to-image Diffusion Models
ICLR 2024
UNICORN: A Unified Backdoor Trigger Inversion Framework
ICLR 2023
Where Did I Come From? Origin Attribution of AI-Generated Images
NIPS 2023
NOTABLE: Transferable Backdoor Attacks Against Prompt-based NLP Models
ACL 2023
Rethinking the Reverse-engineering of Trojan Triggers
NIPS 2022
Complex Backdoor Detection by Symmetric Feature Differencing
CVPR 2022
BppAttack: Stealthy and Efficient Trojan Attacks Against Deep Neural Networks via Image Quantization and Contrastive Adversarial Learning
CVPR 2022
Training with More Confidence: Mitigating Injected and Natural Backdoors During Training
NIPS 2022