Xuehai Tang
7 papers · 2025–2026 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+1 more ↓ Show less ↑
π Conference Polyglot (2) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (12) π§ Keyword Pioneer π Cross-Pollinator (15)
β
Rising Star
(5)
Conferences
ACL (3)
AAAI (2)
EMNLP (2)
Top co-authors
Keywords
jailbreak attack
(4)
large language model
(4)
safety alignment
(2)
chain-of-thought reasoning
(1)
model safety
(1)
model adaptation
(1)
model editing
(1)
constrained optimization
(1)
model alignment
(1)
backdoor attack
(1)
adversarial attack
(1)
adversarial defense
(1)
lyapunov optimization
(1)
low-rank adaptation
(1)
red teaming
(1)
cognitive bia
(1)
knowledge preservation
(1)
vulnerability identification
(1)
attack success rate
(1)
sequential editing
(1)
Papers
Exploiting Synergistic Cognitive Biases to Bypass Safety in LLMs
AAAI 2026
Hidden in the Noise: Unveiling Backdoors in Audio LLMs Alignment Through Latent Acoustic Pattern Triggers
AAAI 2026
Resolving the Security-Auditability Dilemma with Auditable Latent Chain-of-Thought Alignment
ACL 2026
More Thinking, Less Talking: Internalizing Deliberative Safety into LLM Parameters
ACL 2026
LyapLock: Bounded Knowledge Preservation in Sequential Large Language Model Editing
EMNLP 2025
Gamma-Guard: Lightweight Residual Adapters for Robust Guardrails in Large Language Models
EMNLP 2025
Chain of Attack: Hide Your Intention through Multi-Turn Interrogation
ACL 2025