Bernardo Avila Pires
13 papers · 2013–2025 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+5 more ↓ Show less ↑
π Academic Marathon (12) π Cross-Pollinator (8) πΊοΈ Taxonomy Completionist (21) π§ Keyword Pioneer π£ Hot Topic Early Bird
π
Interdisciplinary Bridge
π
Conference Polyglot
(5)
π
Century Club
(13)
π
Trend Setter
π
Conference Pioneer
Conferences
ICML (7)
NIPS (3)
AISTATS (1)
COLT (1)
JMLR (1)
Top co-authors
Keywords
reinforcement learning
(3)
self-supervised learning
(3)
representation learning
(3)
deep reinforcement learning
(3)
multi-step learning
(2)
policy optimization
(2)
latent representation
(2)
off-policy learning
(2)
multiclass classification
(1)
markov decision process
(1)
distributional reinforcement learning
(1)
loss landscape
(1)
temporal difference
(1)
value iteration
(1)
contrastive learning
(1)
surrogate loss
(1)
policy gradient
(1)
model-based reinforcement learning
(1)
quantile regression
(1)
transfer learning
(1)
Papers
Optimizing Return Distributions with Distributional Dynamic Programming
JMLR 2025
A Unifying Framework for Action-Conditional Self-Predictive Reinforcement Learning
AISTATS 2025
Human Alignment of Large Language Models through Online Preference Optimisation
ICML 2024
Generalized Preference Optimization: A Unified Approach to Offline Alignment
ICML 2024
Understanding Self-Predictive Learning for Reinforcement Learning
ICML 2023
Understanding Plasticity in Neural Networks
ICML 2023
DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm
ICML 2023
BYOL-Explore: Exploration by Bootstrapped Prediction
NIPS 2022
The Nature of Temporal Difference Errors in Multi-step Distributional Reinforcement Learning
NIPS 2022
Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning
ICML 2020
Bootstrap Your Own Latent - A New Approach to Self-Supervised Learning
NIPS 2020
Policy Error Bounds for Model-Based Reinforcement Learning with Factored Linear Models
COLT 2016
Cost-sensitive Multiclass Classification Risk Bounds
ICML 2013