conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Learning Types
Machine Learning
›
Learning Types
›
Reinforcement Learning
2,932 papers
Papers per year
2003: 1
2006: 11
2007: 18
2008: 23
2009: 14
2010: 22
2011: 24
2012: 34
2013: 26
2014: 24
2015: 14
2016: 23
2017: 79
2018: 182
2019: 255
2020: 284
2021: 333
2022: 319
2023: 315
2024: 457
2025: 419
2026: 55
Papers
Model Selection in Batch Policy Optimization
ICML 2022
Deconfounded Value Decomposition for Multi-Agent Reinforcement Learning
ICML 2022
Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
ICML 2022
Zero-Shot Reward Specification via Grounded Natural Language
ICML 2022
Learning Stochastic Shortest Path with Linear Function Approximation
ICML 2022
A Simple Reward-free Approach to Constrained Reinforcement Learning
ICML 2022
The Importance of Non-Markovianity in Maximum State Entropy Exploration
ICML 2022
Improved Regret for Differentially Private Exploration in Linear MDP
ICML 2022
Recurrent Model-Free RL Can Be a Strong Baseline for Many POMDPs
ICML 2022
Optimal Estimation of Policy Gradient via Double Fitted Iteration
ICML 2022
The Primacy Bias in Deep Reinforcement Learning
ICML 2022
Constrained Offline Policy Optimization
ICML 2022
Direct Behavior Specification via Constrained Reinforcement Learning
ICML 2022
Off-Policy Evaluation for Large Action Spaces via Embeddings
ICML 2022
Pessimistic Q-Learning for Offline Reinforcement Learning: Towards Optimal Sample Complexity
ICML 2022
A Minimax Learning Approach to Off-Policy Evaluation in Confounded Partially Observable Markov Decision Processes
ICML 2022
Influence-Augmented Local Simulators: a Scalable Solution for Fast Deep RL in Large Networked Systems
ICML 2022
Generalised Policy Improvement with Geometric Policy Composition
ICML 2022
From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses
ICML 2022
A Temporal-Difference Approach to Policy Gradient Estimation
ICML 2022
Interpretable Off-Policy Learning via Hyperbox Search
ICML 2022
First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach
ICML 2022
Reward-Free RL is No Harder Than Reward-Aware RL in Linear Markov Decision Processes
ICML 2022
Safe Exploration for Efficient Policy Evaluation and Comparison
ICML 2022
Towards Evaluating Adaptivity of Model-Based Reinforcement Learning Methods
ICML 2022
<
1
…
60
61
62
…
118
>