conftrace_

Zidi Xiong

9 papers · 2023–2026 · 5 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+1 more ↓

🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (4) 🐝 Cross-Pollinator (10) 🗺️ Taxonomy Completionist (10)

👥 Mega-Team (25)

Conferences

ICML (3) ICLR (2) NIPS (2) ACL (1) EMNLP (1)

Top co-authors

Bo Li (7) Zhen Xiang (5) Dawn Song (4) Chulin Xie (3) Chejian Xu (2) Zhuowen Yuan (2) Zinan Lin (2) Dan Hendrycks (2) Jiawei Zhang (2) Yi Zeng (2)

Keywords

backdoor attack (2) conformal prediction (1) probabilistic modeling (1) anomaly detection (1) adversarial machine learning (1) toxicity detection (1) error propagation (1) multilingual reasoning (1) experience replay (1) clustering approach (1) memory management (1) certified defense (1) llm agent (1) large reasoning model (1) unsupervised model detection (1) adversarial target (1) large language model (1) reasoning accuracy (1) language mismatch (1) trustworthiness evaluation (1)

Papers

How Memory Management Impacts LLM Agents: An Empirical Study of Experience-Following Behavior ACL 2026 When Models Reason in Your Language: Controlling Thinking Language Comes at the Cost of Accuracy EMNLP 2025 MMDT: Decoding the Trustworthiness and Safety of Multimodal Foundation Models ICLR 2025 GuardAgent: Safeguard LLM Agents via Knowledge-Enabled Reasoning ICML 2025 RigorLLM: Resilient Guardrails for Large Language Models against Undesired Content ICML 2024 BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models ICLR 2024 CBD: A Certified Backdoor Detector Based on Local Dominant Probability NIPS 2023 UMD: Unsupervised Model Detection for X2X Backdoor Attacks ICML 2023 DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models NIPS 2023