Zhenting Wang

28 papers · 2022–2026 · 9 conferences · across top CS/AI conferences

Achievements

+8 more ↓

🗺️ Taxonomy Completionist (34) 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (8) 🌍 Conference Polyglot (9) 🧭 Keyword Pioneer

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🤝 Dynamic Duo (14) ❓ The Questioner (4) ⚡ Prolific Year (16) 💎 Century Club (27) 🔥 Unstoppable (5) 🗃️ Keyword Collector (93)

Conferences

CVPR (6) ICLR (5) ACL (4) ICML (4) NAACL (3) NIPS (3) COLING (1) ECCV (1) WACV (1)

Top co-authors

Shiqing Ma (14) Lingjuan Lyu (7) Dimitris N. Metaxas (7) Juan Zhai (6) Yongfeng Zhang (5) Mingyu Jin (5) Kai Mei (5) Shiyu Zhao (4) Chen Chen (4) Xiangyu Zhang (4)

Research topics

Privacy (4) Resources & Methods (1)

Keywords

backdoor attack (6) adversarial attack (4) neural network (4) large language model (4) trojan attack (3) adversarial learning (3) neural network security (3) chain-of-thought reasoning (2) model security (2) image classification (2) backdoor defense (2) backdoor detection (2) multimodal large language model (2) reverse engineering (2) ai-generated content (1) prompt engineering (1) in-context learning (1) zero-shot learning (1) model safety (1) self-supervised learning (1)

Papers

Mitigating Backdoor Attacks via Trigger Reconstruction and Model Hardening WACV 2026 Reasoning over Precedents Alongside Statutes: Case-Augmented Deliberative Alignment for LLM Safety ACL 2026 EmojiPrompt: Generative Prompt Obfuscation for Privacy-Preserving Communication with Cloud-based LLMs NAACL 2025 Token-Budget-Aware LLM Reasoning ACL 2025 ADO: Automatic Data Optimization for Inputs in LLM Prompts ACL 2025 Exploring Concept Depth: How Large Language Models Acquire Knowledge and Concept at Different Layers? COLING 2025 Data-centric NLP Backdoor Defense from the Lens of Memorization NAACL 2025 Invisible Backdoor Attack against Self-supervised Learning CVPR 2025 MLLM-as-a-Judge for Image Safety without Human Labeling CVPR 2025 Accelerating Multimodal Large Language Models by Searching Optimal Vision Token Reduction CVPR 2025 CO-SPY: Combining Semantic and Pixel Features to Detect Synthetic Images by AI CVPR 2025 The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models Via Visual Information Steering ICML 2025 ProSec: Fortifying Code LLMs with Proactive Security Alignment ICML 2025 Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents ICLR 2025 Visual Agents as Fast and Slow Thinkers ICLR 2025 How to Evaluate and Mitigate IP Infringement in Visual Generative AI? ICML 2025 An Optimizable Suffix Is Worth A Thousand Templates: Efficient Black-box Jailbreaking without Affirmative Phrases via LLM as Optimizer NAACL 2025 LoR-VP: Low-Rank Visual Prompting for Efficient Vision Model Adaptation ICLR 2025 Finding a needle in a haystack: A Black-Box Approach to Invisible Watermark Detection ECCV 2024 How to Trace Latent Generative Model Generated Images without Artificial Watermark? ICML 2024 DIAGNOSIS: Detecting Unauthorized Data Usages in Text-to-image Diffusion Models ICLR 2024 UNICORN: A Unified Backdoor Trigger Inversion Framework ICLR 2023 Where Did I Come From? Origin Attribution of AI-Generated Images NIPS 2023 NOTABLE: Transferable Backdoor Attacks Against Prompt-based NLP Models ACL 2023 Rethinking the Reverse-engineering of Trojan Triggers NIPS 2022 Complex Backdoor Detection by Symmetric Feature Differencing CVPR 2022 BppAttack: Stealthy and Efficient Trojan Attacks Against Deep Neural Networks via Image Quantization and Contrastive Adversarial Learning CVPR 2022 Training with More Confidence: Mitigating Injected and Natural Backdoors During Training NIPS 2022