Aleksandar Makelov
4 papers · 2018–2025 · 2 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+1 more ↓ Show less ↑
🌍 Conference Polyglot (2) 🏃 Academic Marathon (7) 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (15) ❓ The Questioner
🚀
Conference Pioneer
Conferences
ICLR (3)
ICML (1)
Top co-authors
Research topics
Papers
Towards Principled Evaluations of Sparse Autoencoders for Interpretability and Control
ICLR 2025
Is This the Subspace You Are Looking for? An Interpretability Illusion for Subspace Activation Patching
ICLR 2024
Rethinking Backdoor Attacks
ICML 2023
Towards Deep Learning Models Resistant to Adversarial Attacks
ICLR 2018