Zidi Xiong
9 papers · 2023–2026 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+1 more ↓ Show less ↑
π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (4) π Cross-Pollinator (10) πΊοΈ Taxonomy Completionist (10)
π₯
Mega-Team
(25)
Conferences
ICML (3)
ICLR (2)
NIPS (2)
ACL (1)
EMNLP (1)
Top co-authors
Keywords
backdoor attack
(2)
conformal prediction
(1)
probabilistic modeling
(1)
anomaly detection
(1)
adversarial machine learning
(1)
toxicity detection
(1)
error propagation
(1)
multilingual reasoning
(1)
experience replay
(1)
clustering approach
(1)
memory management
(1)
certified defense
(1)
llm agent
(1)
large reasoning model
(1)
unsupervised model detection
(1)
adversarial target
(1)
large language model
(1)
reasoning accuracy
(1)
language mismatch
(1)
trustworthiness evaluation
(1)
Papers
How Memory Management Impacts LLM Agents: An Empirical Study of Experience-Following Behavior
ACL 2026
When Models Reason in Your Language: Controlling Thinking Language Comes at the Cost of Accuracy
EMNLP 2025
MMDT: Decoding the Trustworthiness and Safety of Multimodal Foundation Models
ICLR 2025
GuardAgent: Safeguard LLM Agents via Knowledge-Enabled Reasoning
ICML 2025
RigorLLM: Resilient Guardrails for Large Language Models against Undesired Content
ICML 2024
BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models
ICLR 2024
CBD: A Certified Backdoor Detector Based on Local Dominant Probability
NIPS 2023
UMD: Unsupervised Model Detection for X2X Backdoor Attacks
ICML 2023
DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models
NIPS 2023