Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Machine Learning
›
Learning Types
›
Reinforcement Learning
2932 directly classified papers
Papers per year
2003: 1
2006: 11
2007: 18
2008: 23
2009: 14
2010: 22
2011: 24
2012: 34
2013: 26
2014: 24
2015: 14
2016: 23
2017: 79
2018: 182
2019: 255
2020: 284
2021: 333
2022: 319
2023: 315
2024: 457
2025: 419
2026: 55
Papers
On the Power of Pre-training for Generalization in RL: Provable Benefits and Hardness
ICML 2023
Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL
ICML 2023
An Instrumental Variable Approach to Confounded Off-Policy Evaluation
ICML 2023
Regret Bounds for Markov Decision Processes with Recursive Optimized Certainty Equivalents
ICML 2023
Universal Morphology Control via Contextual Modulation
ICML 2023
Controlling Type Confounding in Ad Hoc Teamwork with Instance-wise Teammate Feedback Rectification
ICML 2023
Semiparametrically Efficient Off-Policy Evaluation in Linear Markov Decision Processes
ICML 2023
Future-conditioned Unsupervised Pretraining for Decision Transformer
ICML 2023
Differentially Private Episodic Reinforcement Learning with Heavy-tailed Rewards
ICML 2023
Distributional Offline Policy Evaluation with Predictive Error Guarantees
ICML 2023
The Blessing of Heterogeneity in Federated Q-Learning: Linear Speedup and Beyond
ICML 2023
Warm-Start Actor-Critic: From Approximation Error to Sub-optimality Gap
ICML 2023
SeMAIL: Eliminating Distractors in Visual Imitation via Separated Models
ICML 2023
Multi-Environment Pretraining Enables Transfer to Action Limited Datasets
ICML 2023
The Virtues of Laziness in Model-based RL: A Unified Objective and Algorithms
ICML 2023
Computationally Efficient PAC RL in POMDPs with Latent Determinism and Conditional Embeddings
ICML 2023
Fast Rates for Maximum Entropy Exploration
ICML 2023
Reinforcement Learning with History Dependent Dynamic Contexts
ICML 2023
VA-learning as a more efficient alternative to Q-learning
ICML 2023
Towards a better understanding of representation dynamics under TD-learning
ICML 2023
DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm
ICML 2023
Tile Networks: Learning Optimal Geometric Layout for Whole-page Recommendation
AISTATS 2022
Sample Complexity of Robust Reinforcement Learning with a Generative Model
AISTATS 2022
Finite Sample Analysis of Mean-Volatility Actor-Critic for Risk-Averse Reinforcement Learning
AISTATS 2022
Towards an Understanding of Default Policies in Multitask Policy Optimization
AISTATS 2022
<
1
…
49
50
51
…
118
>