conftrace_

Tom Joy

5 papers · 2021–2024 · 4 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+2 more ↓

🌍 Conference Polyglot (4) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🗺️ Taxonomy Completionist (15) 🐝 Cross-Pollinator (4)

🌈 Renaissance Researcher (5) ❓ The Questioner

Conferences

ICLR (2) AAAI (1) CVPR (1) NIPS (1)

Top co-authors

Puneet K. Dokania (3) Philip H.S. Torr (2) Kemal Oksuz (2) Siddharth N (2) Tom Rainforth (2) Philip Torr (2) Yuge Shi (1) Ser-Nam Lim (1) Sebastian M Schmon (1) Ekdeep Singh Lubana (1)

Keywords

adversarial learning (1) uncertainty quantification (1) object detection (1) direct preference optimization (1) preference alignment (1) preference optimization (1) autonomous driving (1) confidence calibration (1) out-of-distribution detection (1) mechanistic interpretability (1) safety fine-tuning (1) domain shift (1) temperature scaling (1) adversarial input (1) mlp weight transformation (1) jailbreak defense (1) expected calibration error (1) neural network calibration (1) large language model (1) neural network (1)

Papers

What Makes and Breaks Safety Fine-tuning? A Mechanistic Study NIPS 2024 Sample-Dependent Adaptive Temperature Scaling for Improved Calibration AAAI 2023 Towards Building Self-Aware Object Detectors via Reliable Uncertainty Quantification and Calibration CVPR 2023 Learning Multimodal VAEs through Mutual Supervision ICLR 2022 Capturing Label Characteristics in VAEs ICLR 2021