Buck Shlegeris
4 papers · 2022–2025 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓
🌍
Conference Polyglot
(3)
🌉
Interdisciplinary Bridge
🧭
Keyword Pioneer
🐣
Hot Topic Early Bird
🐝
Cross-Pollinator
(15)
Conferences
ICLR (2)
ICML (1)
NIPS (1)
Top co-authors
Papers
Adaptive Deployment of Untrusted LLMs Reduces Distributed Threats
ICLR 2025
AI Control: Improving Safety Despite Intentional Subversion
ICML 2024
Interpretability in the Wild: a Circuit for Indirect Object Identification in GPT-2 Small
ICLR 2023
Adversarial training for high-stakes reliability
NIPS 2022