Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Policy Learning
2068 directly classified papers
Papers per year
2002: 6
2003: 1
2004: 1
2006: 11
2007: 10
2008: 14
2009: 9
2010: 23
2011: 15
2012: 25
2013: 25
2014: 24
2015: 23
2016: 27
2017: 61
2018: 107
2019: 187
2020: 216
2021: 274
2022: 259
2023: 321
2024: 247
2025: 153
2026: 29
Papers
Policy Regularization with Dataset Constraint for Offline Reinforcement Learning
ICML 2023
Reparameterized Policy Learning for Multimodal Trajectory Optimization
ICML 2023
Posterior Sampling for Deep Reinforcement Learning
ICML 2023
Adaptive Reward Shifting Based on Behavior Proximity for Offline Reinforcement Learning
IJCAI 2023
Zero-Shot Linear Combinations of Grounded Social Interactions with Linear Social MDPs
AAAI 2023
Adaptive Barrier Smoothing for First-Order Policy Gradient with Contact Dynamics
ICML 2023
Boosting Offline Reinforcement Learning with Action Preference Query
ICML 2023
On the Global Convergence of Risk-Averse Policy Gradient Methods with Expected Conditional Risk Measures
ICML 2023
Best of Both Worlds Policy Optimization
ICML 2023
Warm-Start Actor-Critic: From Approximation Error to Sub-optimality Gap
ICML 2023
Actor-Critic Alignment for Offline-to-Online Reinforcement Learning
ICML 2023
STEER: Unified Style Transfer with Expert Reinforcement
EMNLP 2023
The Power of Learned Locally Linear Models for Nonlinear Policy Optimization
ICML 2023
VA-learning as a more efficient alternative to Q-learning
ICML 2023
Hierarchical Imitation Learning with Vector Quantized Models
ICML 2023
Reinforcement Learning with History Dependent Dynamic Contexts
ICML 2023
Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path
ICML 2023
Does Sparsity Help in Learning Misspecified Linear Bandits?
ICML 2023
Transferable Curricula through Difficulty Conditioned Generators
IJCAI 2023
Spotlight News Driven Quantitative Trading Based on Trajectory Optimization
IJCAI 2023
Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL
ICML 2023
A Connection between One-Step RL and Critic Regularization in Reinforcement Learning
ICML 2023
Nearly Minimax Optimal Reinforcement Learning for Linear Markov Decision Processes
ICML 2023
Refined Regret for Adversarial MDPs with Linear Function Approximation
ICML 2023
Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning
ICML 2023
<
1
…
18
19
20
…
83
>