conftrace_

Thomas Mesnard

6 papers · 2019–2024 · 2 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+2 more ↓

🌍 Conference Polyglot (2) 🏃 Academic Marathon (5) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (10) 🧭 Keyword Pioneer

🐣 Hot Topic Early Bird 🐝 Cross-Pollinator (15)

Conferences

ICML (5) NIPS (1)

Top co-authors

Rémi Munos (5) Will Dabney (3) Michal Valko (3) Eric Moulines (2) Yunhao Tang (2) Nicolas Heess (2) Doina Precup (2) Alaa Saade (2) Anna Harutyunyan (2) Mark Rowland (2)

Keywords

credit assignment (3) variance reduction (2) policy gradient (2) value function (2) value estimation (1) curiosity-driven learning (1) intrinsic motivation (1) counterfactual reasoning (1) model-free reinforcement learning (1) stochastic environment (1) intrinsic exploration (1) curiosity-driven exploration (1) representation learning (1) distributional value estimation (1) reinforcement learning (1) temporal difference learning (1) distributional reinforcement learning (1)

Papers

RLAIF vs. RLHF: Scaling Reinforcement Learning from Human Feedback with AI Feedback ICML 2024 Nash Learning from Human Feedback ICML 2024 Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments ICML 2023 Quantile Credit Assignment ICML 2023 Counterfactual Credit Assignment in Model-Free Reinforcement Learning ICML 2021 Hindsight Credit Assignment NIPS 2019