Alexander Robey
14 papers · 2019–2025 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
🌍 Conference Polyglot (7) 🏃 Academic Marathon (6) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (12)
🌈
Renaissance Researcher
(7)
🌍
Conference Polyglot
(7)
🤝
Dynamic Duo
(11)
👑
Triple Crown
👥
Mega-Team
(23)
💎
Century Club
(14)
❓
The Questioner
🗃️
Keyword Collector
(62)
Conferences
NIPS (5)
ICLR (2)
ICML (2)
L4DC (2)
AACL (1)
CORL (1)
IJCNLP (1)
Top co-authors
Research topics
Keywords
adversarial robustness
(3)
jailbreak attack
(3)
adversarial training
(3)
large language model
(2)
sample complexity
(2)
domain generalization
(2)
semantic smoothing
(2)
policy optimization
(1)
imitation learning
(1)
adversarial learning
(1)
distribution shift
(1)
language model alignment
(1)
distributed optimization
(1)
ai safety
(1)
convex optimization
(1)
continuous control
(1)
stability constraint
(1)
semidefinite programming
(1)
message passing
(1)
constrained optimization
(1)
Papers
Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing
AACL 2025
Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing
IJCNLP 2025
Adversarial Training Should Be Cast as a Non-Zero-Sum Game
ICLR 2024
Position: A Safe Harbor for AI Evaluation and Red Teaming
ICML 2024
JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models
NIPS 2024
On the Sample Complexity of Stability Constrained Imitation Learning
L4DC 2022
Probable Domain Generalization via Quantile Risk Minimization
NIPS 2022
Do deep networks transfer invariances across classes?
ICLR 2022
Probabilistically Robust Learning: Balancing Average and Worst-case Performance
ICML 2022
Model-Based Domain Generalization
NIPS 2021
Adversarial Robustness with Semi-Infinite Constrained Learning
NIPS 2021
Optimal Algorithms for Submodular Maximization with Distributed Constraints
L4DC 2021
Learning Hybrid Control Barrier Functions from Data
CORL 2020
Efficient and Accurate Estimation of Lipschitz Constants for Deep Neural Networks
NIPS 2019