Yuanpu Cao
12 papers · 2020–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+5 more ↓ Show less ↑
π£ Hot Topic Early Bird π Interdisciplinary Bridge π§ Keyword Pioneer π Renaissance Researcher (5) πΊοΈ Taxonomy Completionist (20)
π
Conference Polyglot
(7)
π
Academic Marathon
(5)
π
Cross-Pollinator
(6)
π
Century Club
(10)
β‘
Prolific Year
(5)
Conferences
ACL (4)
ICML (2)
NAACL (2)
EMNLP (1)
ICLR (1)
IJCAI (1)
NIPS (1)
Top co-authors
Keywords
large language model
(4)
adversarial attack
(3)
ai safety
(2)
model alignment
(2)
multimodal large language model
(2)
policy learning
(1)
preference optimization
(1)
model security
(1)
harmful content
(1)
jailbreak attack
(1)
safety alignment
(1)
chain-of-thought reasoning
(1)
backdoor attack
(1)
model safety
(1)
hallucination mitigation
(1)
language model alignment
(1)
jailbreaking attack
(1)
safety evaluation
(1)
knowledge editing
(1)
adversarial robustness
(1)
Papers
Can Factual Opinions Be Edited (Manipulated) in Large Language Models?
ACL 2026
ICDAGENT: Empowering Agentic Large Language Models for Explainable Medical Coding
ACL 2026
Phi: Preference Hijacking in Multi-modal Large Language Models at Inference Time
EMNLP 2025
TruthFlow: Truthful LLM Generation via Representation Flow Correction
ICML 2025
AdvI2I: Adversarial Image Attack on Image-to-Image Diffusion Models
ICML 2025
Shadow-Activated Backdoor Attacks on Multimodal Large Language Models
ACL 2025
WordGame: Efficient & Effective LLM Jailbreak via Simultaneous Obfuscation in Query and Response
NAACL 2025
Stealthy and Persistent Unalignment on Large Language Models via Backdoor Injections
NAACL 2024
Defending Against Alignment-Breaking Attacks via Robustly Aligned LLM
ACL 2024
Tackling the Data Heterogeneity in Asynchronous Federated Learning with Cached Update Calibration
ICLR 2024
Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization
NIPS 2024
RLCard: A Platform for Reinforcement Learning in Card Games
IJCAI 2020