Zheli Liu
5 papers · 2024–2026 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+1 more ↓ Show less ↑
π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (3) π Cross-Pollinator (13) π Renaissance Researcher (5)
πΊοΈ
Taxonomy Completionist
(10)
Conferences
ACL (3)
EMNLP (1)
ICLR (1)
Top co-authors
Keywords
model extraction attack
(1)
backdoor defense
(1)
supervised learning
(1)
safety alignment
(1)
backdoor attack
(1)
model collapse
(1)
copyright protection
(1)
watermark detection
(1)
fine-tuning attack
(1)
hallucination detection
(1)
harmful fine-tuning
(1)
cross-domain generalization
(1)
internal state
(1)
embedding watermarking
(1)
copyright infringement
(1)
selective unlearning
(1)
activation space
(1)
large language model
(1)
backdoor watermarking
(1)
semantic perturbation
(1)
Papers
CTRAP: Embedding Collapse Trap to Safeguard Large Language Models from Harmful Fine-Tuning
ACL 2026
Prompt-Guided Internal States for Hallucination Detection of Large Language Models
ACL 2025
Your Semantic-Independent Watermark is Fragile: A Semantic Perturbation Attack against EaaS Watermark
EMNLP 2025
Probe before You Talk: Towards Black-box Defense against Backdoor Unalignment for Large Language Models
ICLR 2025
BadActs: A Universal Backdoor Defense in the Activation Space
ACL 2024