Zhengwei Fang
4 papers · 2024–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓
π
Interdisciplinary Bridge
π§
Keyword Pioneer
π
Conference Polyglot
(4)
π
Cross-Pollinator
(11)
πΊοΈ
Taxonomy Completionist
(13)
Conferences
CVPR (1)
ICML (1)
NAACL (1)
NIPS (1)
Top co-authors
Keywords
adversarial attack
(2)
black-box optimization
(1)
ensemble learning
(1)
posterior distribution
(1)
black-box attack
(1)
asymptotic normality
(1)
multimodal large language model
(1)
jailbreak attack
(1)
adversarial prompt
(1)
prompt optimization
(1)
privacy leakage
(1)
trustworthiness benchmark
(1)
trustworthy ai
(1)
security vulnerabilities
(1)
large language model
(1)
trustworthiness evaluation
(1)
multimodal bia
(1)
trustworthy multimodal large language model
(1)
multimodal trustworthiness benchmark
(1)
multimodal jailbreaking
(1)
Papers
STAIR: Improving Safety Alignment with Introspective Reasoning
ICML 2025
AutoBreach: Universal and Adaptive Jailbreaking with Efficient Wordplay-Guided Optimization via Multi-LLMs
NAACL 2025
MultiTrust: A Comprehensive Benchmark Towards Trustworthy Multimodal Large Language Models
NIPS 2024
Strong Transferable Adversarial Attacks via Ensembled Asymptotically Normal Distribution Learning
CVPR 2024