Thomas Mesnard
6 papers · 2019–2024 · 2 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
π Conference Polyglot (2) π Academic Marathon (5) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (10) π§ Keyword Pioneer
π£
Hot Topic Early Bird
π
Cross-Pollinator
(15)
Conferences
ICML (5)
NIPS (1)
Top co-authors
Keywords
credit assignment
(3)
variance reduction
(2)
policy gradient
(2)
value function
(2)
value estimation
(1)
curiosity-driven learning
(1)
intrinsic motivation
(1)
counterfactual reasoning
(1)
model-free reinforcement learning
(1)
stochastic environment
(1)
intrinsic exploration
(1)
curiosity-driven exploration
(1)
representation learning
(1)
distributional value estimation
(1)
reinforcement learning
(1)
temporal difference learning
(1)
distributional reinforcement learning
(1)
Papers
RLAIF vs. RLHF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
ICML 2024
Nash Learning from Human Feedback
ICML 2024
Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments
ICML 2023
Quantile Credit Assignment
ICML 2023
Counterfactual Credit Assignment in Model-Free Reinforcement Learning
ICML 2021
Hindsight Credit Assignment
NIPS 2019