conftrace_

Justin Svegliato

6 papers · 2018–2025 · 6 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+2 more ↓

🌍 Conference Polyglot (6) 🧭 Keyword Pioneer 🗺️ Taxonomy Completionist (11) 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (7)

🐝 Cross-Pollinator (7) 🏆 Grand Slam

Conferences

AAAI (1) ICLR (1) ICML (1) IJCAI (1) NAACL (1) NIPS (1)

Top co-authors

Sam Toyer (2) Pieter Abbeel (2) Stuart Russell (2) Shlomo Zilberstein (2) Olivia Watkins (2) Kyle Hollins Wray (1) Jason Eisner (1) Isaac Ong (1) Luke Bailey (1) Alexandra Souly (1)

Keywords

online learning (1) model calibration (1) sequential decision (1) online prediction (1) computation time (1) anytime algorithm (1) language model (1) tool use (1) safety fine-tuning (1) autonomous system (1) confidence estimation (1) performance prediction (1) meta-level control (1) attack success rate (1) logit len (1) ethical framework (1) algorithm control (1) harmfulness evaluation (1) virtue ethics (1) divine command theory (1)

Papers

AssistanceZero: Scalably Solving Assistance Games ICML 2025 MICE for CATs: Model-Internal Confidence Estimation for Calibrating Agents with Tools NAACL 2025 A StrongREJECT for Empty Jailbreaks NIPS 2024 Tensor Trust: Interpretable Prompt Injection Attacks from an Online Game ICLR 2024 Ethically Compliant Sequential Decision Making AAAI 2021 Meta-Level Control of Anytime Algorithms with Online Performance Prediction IJCAI 2018