Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Policy Learning
2068 directly classified papers
Papers per year
2002: 6
2003: 1
2004: 1
2006: 11
2007: 10
2008: 14
2009: 9
2010: 23
2011: 15
2012: 25
2013: 25
2014: 24
2015: 23
2016: 27
2017: 61
2018: 107
2019: 187
2020: 216
2021: 274
2022: 259
2023: 321
2024: 247
2025: 153
2026: 29
Papers
Exploration by Maximizing Renyi Entropy for Reward-Free RL Framework
AAAI 2021
Sample Efficient Reinforcement Learning with REINFORCE
AAAI 2021
Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning
AAAI 2021
Explicable Reward Design for Reinforcement Learning Agents
NIPS 2021
Policy Learning Using Weak Supervision
NIPS 2021
Bridging the Imitation Gap by Adaptive Insubordination
NIPS 2021
Autonomous Reinforcement Learning via Subgoal Curricula
NIPS 2021
Reinforcement learning for optimization of variational quantum circuit architectures
NIPS 2021
Learning Policies with Zero or Bounded Constraint Violation for Constrained MDPs
NIPS 2021
An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning
NIPS 2021
Sample-Efficient Reinforcement Learning Is Feasible for Linearly Realizable MDPs with Limited Revisiting
NIPS 2021
Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection
NIPS 2021
The Logical Options Framework
ICML 2021
Robust Reinforcement Learning using Least Squares Policy Iteration with Provable Performance Guarantees
ICML 2021
Model-based Reinforcement Learning for Continuous Control with Posterior Sampling
ICML 2021
Diversity Actor-Critic: Sample-Aware Entropy Regularization for Sample-Efficient Exploration
ICML 2021
Generalizable Imitation Learning from Observation via Inferring Goal Proximity
NIPS 2021
The Sample Complexity of Teaching by Reinforcement on Q-Learning
AAAI 2021
Uniform-PAC Bounds for Reinforcement Learning with Linear Function Approximation
NIPS 2021
Provable Benefits of Actor-Critic Methods for Offline Reinforcement Learning
NIPS 2021
Settling the Variance of Multi-Agent Policy Gradients
NIPS 2021
Accommodating Picky Customers: Regret Bound and Exploration Complexity for Multi-Objective Reinforcement Learning
NIPS 2021
Towards Hyperparameter-free Policy Selection for Offline Reinforcement Learning
NIPS 2021
Augmenting Policy Learning with Routines Discovered from a Single Demonstration
AAAI 2021
Inverse Reinforcement Learning with Natural Language Goals
AAAI 2021
<
1
…
44
45
46
…
83
>