Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Policy Learning
2068 directly classified papers
Papers per year
2002: 6
2003: 1
2004: 1
2006: 11
2007: 10
2008: 14
2009: 9
2010: 23
2011: 15
2012: 25
2013: 25
2014: 24
2015: 23
2016: 27
2017: 61
2018: 107
2019: 187
2020: 216
2021: 274
2022: 259
2023: 321
2024: 247
2025: 153
2026: 29
Papers
Robust Asymmetric Learning in POMDPs
ICML 2021
Characterizing the Gap Between Actor-Critic and Policy Gradient
ICML 2021
CRPO: A New Approach for Safe Reinforcement Learning with Convergence Guarantee
ICML 2021
Accelerating Safe Reinforcement Learning with Constraint-mismatched Baseline Policies
ICML 2021
On-Policy Deep Reinforcement Learning for the Average-Reward Criterion
ICML 2021
MetaCURE: Meta Reinforcement Learning with Empowerment-Driven Exploration
ICML 2021
Provably Efficient Reinforcement Learning for Discounted MDPs with Feature Mapping
ICML 2021
"Good Robot! Now Watch This!": Repurposing Reinforcement Learning for Task-to-Task Transfer
CORL 2021
Redundancy Resolution as Action Bias in Policy Search for Robotic Manipulation
CORL 2021
A Constrained Multi-Objective Reinforcement Learning Framework
CORL 2021
Learning Feasibility to Imitate Demonstrators with Different Dynamics
CORL 2021
Nearly Minimax Optimal Reinforcement Learning for Linear Mixture Markov Decision Processes
COLT 2021
Is Reinforcement Learning More Difficult Than Bandits? A Near-optimal Algorithm Escaping the Curse of Horizon
COLT 2021
Cautiously Optimistic Policy Optimization and Exploration with Linear Function Approximation
COLT 2021
Commission Fee is not Enough: A Hierarchical Reinforced Framework for Portfolio Management
AAAI 2021
Learning to Recommend from Sparse Data via Generative User Feedback
AAAI 2021
Hierarchical Reinforcement Learning for Integrated Recommendation
AAAI 2021
Learning Rewards From Linguistic Feedback
AAAI 2021
Bounded Risk-Sensitive Markov Games: Forward Policy Design and Inverse Reward Learning with Iterative Reasoning and Cumulative Prospect Theory
AAAI 2021
Deep Radial-Basis Value Functions for Continuous Control
AAAI 2021
Relative Variational Intrinsic Control
AAAI 2021
Addressing Action Oscillations through Learning Policy Inertia
AAAI 2021
Learning to Reweight Imaginary Transitions for Model-Based Reinforcement Learning
AAAI 2021
Variance Penalized On-Policy and Off-Policy Actor-Critic
AAAI 2021
Temporal-Logic-Based Reward Shaping for Continuing Reinforcement Learning Tasks
AAAI 2021
<
1
…
41
42
43
…
83
>