Justin Svegliato
6 papers · 2018–2025 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
π Conference Polyglot (6) π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (11) π Interdisciplinary Bridge π Academic Marathon (7)
π
Cross-Pollinator
(7)
π
Grand Slam
Conferences
AAAI (1)
ICLR (1)
ICML (1)
IJCAI (1)
NAACL (1)
NIPS (1)
Top co-authors
Keywords
online learning
(1)
model calibration
(1)
sequential decision
(1)
online prediction
(1)
computation time
(1)
anytime algorithm
(1)
language model
(1)
tool use
(1)
safety fine-tuning
(1)
autonomous system
(1)
confidence estimation
(1)
performance prediction
(1)
meta-level control
(1)
attack success rate
(1)
logit len
(1)
ethical framework
(1)
algorithm control
(1)
harmfulness evaluation
(1)
virtue ethics
(1)
divine command theory
(1)
Papers
AssistanceZero: Scalably Solving Assistance Games
ICML 2025
MICE for CATs: Model-Internal Confidence Estimation for Calibrating Agents with Tools
NAACL 2025
A StrongREJECT for Empty Jailbreaks
NIPS 2024
Tensor Trust: Interpretable Prompt Injection Attacks from an Online Game
ICLR 2024
Ethically Compliant Sequential Decision Making
AAAI 2021
Meta-Level Control of Anytime Algorithms with Online Performance Prediction
IJCAI 2018