conftrace_

Yuanpu Cao

12 papers · 2020–2026 · 7 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+5 more ↓

🐣 Hot Topic Early Bird 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌈 Renaissance Researcher (5) 🗺️ Taxonomy Completionist (20)

🌍 Conference Polyglot (7) 🏃 Academic Marathon (5) 🐝 Cross-Pollinator (6) 💎 Century Club (10) ⚡ Prolific Year (5)

Conferences

ACL (4) ICML (2) NAACL (2) EMNLP (1) ICLR (1) IJCAI (1) NIPS (1)

Top co-authors

Jinghui Chen (11) Bochuan Cao (6) Lu Lin (5) Fenglong Ma (4) Ziyi Yin (4) Ting Wang (2) Tianrong Zhang (2) Junyu Guo (1) Xia Hu (1) Yaopei Zeng (1)

Keywords

large language model (4) adversarial attack (3) ai safety (2) model alignment (2) multimodal large language model (2) policy learning (1) preference optimization (1) model security (1) harmful content (1) jailbreak attack (1) safety alignment (1) chain-of-thought reasoning (1) backdoor attack (1) model safety (1) hallucination mitigation (1) language model alignment (1) jailbreaking attack (1) safety evaluation (1) knowledge editing (1) adversarial robustness (1)

Papers

Can Factual Opinions Be Edited (Manipulated) in Large Language Models? ACL 2026 ICDAGENT: Empowering Agentic Large Language Models for Explainable Medical Coding ACL 2026 Phi: Preference Hijacking in Multi-modal Large Language Models at Inference Time EMNLP 2025 TruthFlow: Truthful LLM Generation via Representation Flow Correction ICML 2025 AdvI2I: Adversarial Image Attack on Image-to-Image Diffusion Models ICML 2025 Shadow-Activated Backdoor Attacks on Multimodal Large Language Models ACL 2025 WordGame: Efficient & Effective LLM Jailbreak via Simultaneous Obfuscation in Query and Response NAACL 2025 Stealthy and Persistent Unalignment on Large Language Models via Backdoor Injections NAACL 2024 Defending Against Alignment-Breaking Attacks via Robustly Aligned LLM ACL 2024 Tackling the Data Heterogeneity in Asynchronous Federated Learning with Cached Update Calibration ICLR 2024 Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization NIPS 2024 RLCard: A Platform for Reinforcement Learning in Card Games IJCAI 2020