Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Policy Learning
2068 directly classified papers
Papers per year
2002: 6
2003: 1
2004: 1
2006: 11
2007: 10
2008: 14
2009: 9
2010: 23
2011: 15
2012: 25
2013: 25
2014: 24
2015: 23
2016: 27
2017: 61
2018: 107
2019: 187
2020: 216
2021: 274
2022: 259
2023: 321
2024: 247
2025: 153
2026: 29
Papers
Adaptive Order Q-learning
IJCAI 2024
POCE: Primal Policy Optimization with Conservative Estimation for Multi-constraint Offline Reinforcement Learning
CVPR 2024
Navigating Uncertainty in Epidemic Contexts with Reinforcement Learning
AAAI 2024
Online Reinforcement Learning-Based Pedagogical Planning for Narrative-Centered Learning Environments
AAAI 2024
On the Occupancy Measure of Non-Markovian Policies in Continuous MDPs
ICML 2023
Beyond Reward: Offline Preference-guided Policy Optimization
ICML 2023
Policy Regularization with Dataset Constraint for Offline Reinforcement Learning
ICML 2023
Learning Compiler Pass Orders using Coreset and Normalized Value Prediction
ICML 2023
Inverse Reinforcement Learning without Reinforcement Learning
ICML 2023
Nearly Minimax Optimal Reinforcement Learning for Linear Markov Decision Processes
ICML 2023
Posterior Sampling for Deep Reinforcement Learning
ICML 2023
Efficient Exploration via Epistemic-Risk-Seeking Policy Optimization
ICML 2023
Actor-Critic Alignment for Offline-to-Online Reinforcement Learning
ICML 2023
Better Training of GFlowNets with Local Credit and Incomplete Trajectories
ICML 2023
Revisiting Bellman Errors for Offline Model Selection
ICML 2023
ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs
ICML 2023
Model-based Reinforcement Learning with Scalable Composite Policy Gradient Estimators
ICML 2023
Towards Theoretical Understanding of Inverse Reinforcement Learning
ICML 2023
Quantum Policy Gradient Algorithm with Optimized Action Decoding
ICML 2023
MANSA: Learning Fast and Slow in Multi-Agent Systems
ICML 2023
Double Duality: Variational Primal-Dual Policy Optimization for Constrained Reinforcement Learning
JMLR 2023
LESSON: Learning to Integrate Exploration Strategies for Reinforcement Learning via an Option Framework
ICML 2023
Offline Reinforcement Learning with Closed-Form Policy Improvement Operators
ICML 2023
Internally Rewarded Reinforcement Learning
ICML 2023
Convex Reinforcement Learning in Finite Trials
JMLR 2023
<
1
…
17
18
19
…
83
>