Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Policy Learning
2068 directly classified papers
Papers per year
2002: 6
2003: 1
2004: 1
2006: 11
2007: 10
2008: 14
2009: 9
2010: 23
2011: 15
2012: 25
2013: 25
2014: 24
2015: 23
2016: 27
2017: 61
2018: 107
2019: 187
2020: 216
2021: 274
2022: 259
2023: 321
2024: 247
2025: 153
2026: 29
Papers
Lexicographic Multi-Objective Reinforcement Learning
IJCAI 2022
Training Robots to Evaluate Robots: Example-Based Interactive Reward Functions for Policy Learning
CORL 2022
Deep Whole-Body Control: Learning a Unified Policy for Manipulation and Locomotion
CORL 2022
On the (In)Tractability of Reinforcement Learning for LTL Objectives
IJCAI 2022
TarGF: Learning Target Gradient Field to Rearrange Objects without Explicit Goal Specification
NIPS 2022
Inferring Smooth Control: Monte Carlo Posterior Policy Iteration with Gaussian Processes
CORL 2022
Towards Applicable Reinforcement Learning: Improving the Generalization and Sample Efficiency with Policy Ensemble
IJCAI 2022
Efficient Risk-Averse Reinforcement Learning
NIPS 2022
Global Convergence of Direct Policy Search for State-Feedback $\mathcal{H}_\infty$ Robust Control: A Revisit of Nonsmooth Synthesis with Goldstein Subdifferential
NIPS 2022
Model-Based Offline Planning with Trajectory Pruning
IJCAI 2022
Saute RL: Almost Surely Safe Reinforcement Learning Using State Augmentation
ICML 2022
Divergence-Regularized Multi-Agent Actor-Critic
ICML 2022
Causal Discovery and Reinforcement Learning: A Synergistic Integration
PGM 2022
Do Differentiable Simulators Give Better Policy Gradients?
ICML 2022
Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning
NIPS 2022
Robust On-Policy Sampling for Data-Efficient Policy Evaluation in Reinforcement Learning
NIPS 2022
Data augmentation for efficient learning from parametric experts
NIPS 2022
Near-Optimal Goal-Oriented Reinforcement Learning in Non-Stationary Environments
NIPS 2022
Myriad: a real-world testbed to bridge trajectory optimization and deep learning
NIPS 2022
Robust Imitation via Mirror Descent Inverse Reinforcement Learning
NIPS 2022
Exponential Family Model-Based Reinforcement Learning via Score Matching
NIPS 2022
Robust Anytime Learning of Markov Decision Processes
NIPS 2022
Learning to Grasp the Ungraspable with Emergent Extrinsic Dexterity
CORL 2022
A Few Expert Queries Suffices for Sample-Efficient RL with Resets and Linear Value Approximation
NIPS 2022
Distributional Reinforcement Learning for Risk-Sensitive Policies
NIPS 2022
<
1
…
39
40
41
…
83
>