conftrace_

Bernardo Avila Pires

13 papers · 2013–2025 · 5 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+5 more ↓

🏃 Academic Marathon (12) 🐝 Cross-Pollinator (8) 🗺️ Taxonomy Completionist (21) 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird

🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (5) 💎 Century Club (13) 📈 Trend Setter 🚀 Conference Pioneer

Conferences

ICML (7) NIPS (3) AISTATS (1) COLT (1) JMLR (1)

Top co-authors

Rémi Munos (9) Yunhao Tang (7) Mark Rowland (7) Zhaohan Daniel Guo (6) Michal Valko (6) Bilal Piot (6) Will Dabney (5) Mohammad Gheshlaghi azar (4) Daniele Calandriello (4) Pierre Harvey Richemond (3)

Keywords

reinforcement learning (3) self-supervised learning (3) representation learning (3) deep reinforcement learning (3) multi-step learning (2) policy optimization (2) latent representation (2) off-policy learning (2) multiclass classification (1) markov decision process (1) distributional reinforcement learning (1) loss landscape (1) temporal difference (1) value iteration (1) contrastive learning (1) surrogate loss (1) policy gradient (1) model-based reinforcement learning (1) quantile regression (1) transfer learning (1)

Papers

Optimizing Return Distributions with Distributional Dynamic Programming JMLR 2025 A Unifying Framework for Action-Conditional Self-Predictive Reinforcement Learning AISTATS 2025 Human Alignment of Large Language Models through Online Preference Optimisation ICML 2024 Generalized Preference Optimization: A Unified Approach to Offline Alignment ICML 2024 Understanding Self-Predictive Learning for Reinforcement Learning ICML 2023 Understanding Plasticity in Neural Networks ICML 2023 DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm ICML 2023 BYOL-Explore: Exploration by Bootstrapped Prediction NIPS 2022 The Nature of Temporal Difference Errors in Multi-step Distributional Reinforcement Learning NIPS 2022 Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning ICML 2020 Bootstrap Your Own Latent - A New Approach to Self-Supervised Learning NIPS 2020 Policy Error Bounds for Model-Based Reinforcement Learning with Factored Linear Models COLT 2016 Cost-sensitive Multiclass Classification Risk Bounds ICML 2013