Jacob Hilton
7 papers · 2020–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+5 more ↓ Show less ↑
π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (18) π£ Hot Topic Early Bird π§ Keyword Pioneer π Conference Polyglot (4)
π
Academic Marathon
(5)
π
Cross-Pollinator
(7)
π
Renaissance Researcher
(5)
π₯
Mega-Team
(20)
π
Trend Setter
Conferences
ICML (3)
NIPS (2)
ACL (1)
ICLR (1)
Top co-authors
Keywords
reinforcement learning
(3)
reinforcement learning from human feedback
(2)
reward model
(2)
sample efficiency
(2)
model evaluation
(1)
falsehood detection
(1)
instruction following
(1)
language model alignment
(1)
model alignment
(1)
value function
(1)
off-policy learning
(1)
human feedback
(1)
language model
(1)
synthetic datum
(1)
scaling law
(1)
supervised fine-tuning
(1)
proximal policy optimization
(1)
on-policy learning
(1)
actor-critic method
(1)
feature distillation
(1)
Papers
Estimating the Probabilities of Rare Outputs in Language Models
ICLR 2025
Scaling Laws for Reward Model Overoptimization
ICML 2023
Training language models to follow instructions with human feedback
NIPS 2022
TruthfulQA: Measuring How Models Mimic Human Falsehoods
ACL 2022
Batch size-invariance for policy optimization
NIPS 2022
Phasic Policy Gradient
ICML 2021
Leveraging Procedural Generation to Benchmark Reinforcement Learning
ICML 2020