Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Policy Learning
2068 directly classified papers
Papers per year
2002: 6
2003: 1
2004: 1
2006: 11
2007: 10
2008: 14
2009: 9
2010: 23
2011: 15
2012: 25
2013: 25
2014: 24
2015: 23
2016: 27
2017: 61
2018: 107
2019: 187
2020: 216
2021: 274
2022: 259
2023: 321
2024: 247
2025: 153
2026: 29
Papers
Beyond Reward: Offline Preference-guided Policy Optimization
ICML 2023
MANSA: Learning Fast and Slow in Multi-Agent Systems
ICML 2023
Quantum Policy Gradient Algorithm with Optimized Action Decoding
ICML 2023
Towards Theoretical Understanding of Inverse Reinforcement Learning
ICML 2023
Quantile Credit Assignment
ICML 2023
Revisiting Bellman Errors for Offline Model Selection
ICML 2023
Performative Reinforcement Learning
ICML 2023
Active Policy Improvement from Multiple Black-box Oracles
ICML 2023
Hierarchical Programmatic Reinforcement Learning via Learning to Compose Programs
ICML 2023
Emergent Agentic Transformer from Chain of Hindsight Experience
ICML 2023
Internally Rewarded Reinforcement Learning
ICML 2023
Offline Reinforcement Learning with Closed-Form Policy Improvement Operators
ICML 2023
Policy Gradient Play with Networked Agents in Markov Potential Games
L4DC 2023
Understanding the Complexity Gains of Single-Task RL with a Curriculum
ICML 2023
Low-Switching Policy Gradient with Exploration via Online Sensitivity Sampling
ICML 2023
Efficient Exploration via Epistemic-Risk-Seeking Policy Optimization
ICML 2023
Better Training of GFlowNets with Local Credit and Incomplete Trajectories
ICML 2023
Near-optimal Conservative Exploration in Reinforcement Learning under Episode-wise Constraints
ICML 2023
Delay-Adapted Policy Optimization and Improved Regret for Adversarial MDP with Delayed Bandit Feedback
ICML 2023
Reward-Mixing MDPs with Few Latent Contexts are Learnable
ICML 2023
Hierarchical Imitation Learning with Vector Quantized Models
ICML 2023
Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills
ICML 2023
LESSON: Learning to Integrate Exploration Strategies for Reinforcement Learning via an Option Framework
ICML 2023
Double Duality: Variational Primal-Dual Policy Optimization for Constrained Reinforcement Learning
JMLR 2023
Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional Curriculum
ICML 2023
<
1
…
27
28
29
…
83
>