conftrace_

Xuehai Tang

7 papers · 2025–2026 · 3 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+1 more ↓

🌍 Conference Polyglot (2) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (12) 🧭 Keyword Pioneer 🐝 Cross-Pollinator (15)

⭐ Rising Star (5)

Conferences

ACL (3) AAAI (2) EMNLP (2)

Top co-authors

Songlin Hu (6) Jizhong Han (6) Biyu Zhou (5) Guan Wang (3) Xikang Yang (3) Kaiwen Luo (1) Wen Jie (1) Yibo Zhang (1) Lilan Peng (1) Kun Wang (1)

Keywords

jailbreak attack (4) large language model (4) safety alignment (2) chain-of-thought reasoning (1) model safety (1) model adaptation (1) model editing (1) constrained optimization (1) model alignment (1) backdoor attack (1) adversarial attack (1) adversarial defense (1) lyapunov optimization (1) low-rank adaptation (1) red teaming (1) cognitive bia (1) knowledge preservation (1) vulnerability identification (1) attack success rate (1) sequential editing (1)

Papers

Exploiting Synergistic Cognitive Biases to Bypass Safety in LLMs AAAI 2026 Hidden in the Noise: Unveiling Backdoors in Audio LLMs Alignment Through Latent Acoustic Pattern Triggers AAAI 2026 Resolving the Security-Auditability Dilemma with Auditable Latent Chain-of-Thought Alignment ACL 2026 More Thinking, Less Talking: Internalizing Deliberative Safety into LLM Parameters ACL 2026 LyapLock: Bounded Knowledge Preservation in Sequential Large Language Model Editing EMNLP 2025 Gamma-Guard: Lightweight Residual Adapters for Robust Guardrails in Large Language Models EMNLP 2025 Chain of Attack: Hide Your Intention through Multi-Turn Interrogation ACL 2025