conftrace_

Zheli Liu

5 papers · 2024–2026 · 3 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+1 more ↓

🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (3) 🐝 Cross-Pollinator (13) 🌈 Renaissance Researcher (5)

🗺️ Taxonomy Completionist (10)

Conferences

ACL (3) EMNLP (1) ICLR (1)

Top co-authors

Biao Yi (5) Tong Li (4) Baolei Zhang (3) Sishuo Chen (2) Lihai Nie (2) Tiansheng Huang (2) Yiming Li (2) Zhixuan Chu (1) Li Shen (1) Peiqi Yu (1)

Keywords

model extraction attack (1) backdoor defense (1) supervised learning (1) safety alignment (1) backdoor attack (1) model collapse (1) copyright protection (1) watermark detection (1) fine-tuning attack (1) hallucination detection (1) harmful fine-tuning (1) cross-domain generalization (1) internal state (1) embedding watermarking (1) copyright infringement (1) selective unlearning (1) activation space (1) large language model (1) backdoor watermarking (1) semantic perturbation (1)

Papers

CTRAP: Embedding Collapse Trap to Safeguard Large Language Models from Harmful Fine-Tuning ACL 2026 Prompt-Guided Internal States for Hallucination Detection of Large Language Models ACL 2025 Your Semantic-Independent Watermark is Fragile: A Semantic Perturbation Attack against EaaS Watermark EMNLP 2025 Probe before You Talk: Towards Black-box Defense against Backdoor Unalignment for Large Language Models ICLR 2025 BadActs: A Universal Backdoor Defense in the Activation Space ACL 2024