conftrace_

Jacob Hilton

7 papers · 2020–2025 · 4 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+5 more ↓

🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (18) 🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🌍 Conference Polyglot (4)

🏃 Academic Marathon (5) 🐝 Cross-Pollinator (7) 🌈 Renaissance Researcher (5) 👥 Mega-Team (20) 📈 Trend Setter

Conferences

ICML (3) NIPS (2) ACL (1) ICLR (1)

Top co-authors

John Schulman (5) Karl Cobbe (2) Ryan Lowe (1) Karl W Cobbe (1) Maddie Simens (1) Leo Gao (1) Paul F Christiano (1) Diogo Almeida (1) Long Ouyang (1) Jeffrey Wu (1)

Keywords

reinforcement learning (3) reinforcement learning from human feedback (2) reward model (2) sample efficiency (2) model evaluation (1) falsehood detection (1) instruction following (1) language model alignment (1) model alignment (1) value function (1) off-policy learning (1) human feedback (1) language model (1) synthetic datum (1) scaling law (1) supervised fine-tuning (1) proximal policy optimization (1) on-policy learning (1) actor-critic method (1) feature distillation (1)

Papers

Estimating the Probabilities of Rare Outputs in Language Models ICLR 2025 Scaling Laws for Reward Model Overoptimization ICML 2023 Training language models to follow instructions with human feedback NIPS 2022 TruthfulQA: Measuring How Models Mimic Human Falsehoods ACL 2022 Batch size-invariance for policy optimization NIPS 2022 Phasic Policy Gradient ICML 2021 Leveraging Procedural Generation to Benchmark Reinforcement Learning ICML 2020