Tom Joy
5 papers · 2021–2024 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
π Conference Polyglot (4) π Interdisciplinary Bridge π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (15) π Cross-Pollinator (4)
π
Renaissance Researcher
(5)
β
The Questioner
Conferences
ICLR (2)
AAAI (1)
CVPR (1)
NIPS (1)
Top co-authors
Keywords
adversarial learning
(1)
uncertainty quantification
(1)
object detection
(1)
direct preference optimization
(1)
preference alignment
(1)
preference optimization
(1)
autonomous driving
(1)
confidence calibration
(1)
out-of-distribution detection
(1)
mechanistic interpretability
(1)
safety fine-tuning
(1)
domain shift
(1)
temperature scaling
(1)
adversarial input
(1)
mlp weight transformation
(1)
jailbreak defense
(1)
expected calibration error
(1)
neural network calibration
(1)
large language model
(1)
neural network
(1)
Papers
What Makes and Breaks Safety Fine-tuning? A Mechanistic Study
NIPS 2024
Sample-Dependent Adaptive Temperature Scaling for Improved Calibration
AAAI 2023
Towards Building Self-Aware Object Detectors via Reliable Uncertainty Quantification and Calibration
CVPR 2023
Learning Multimodal VAEs through Mutual Supervision
ICLR 2022
Capturing Label Characteristics in VAEs
ICLR 2021