Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Policy Learning
2068 directly classified papers
Papers per year
2002: 6
2003: 1
2004: 1
2006: 11
2007: 10
2008: 14
2009: 9
2010: 23
2011: 15
2012: 25
2013: 25
2014: 24
2015: 23
2016: 27
2017: 61
2018: 107
2019: 187
2020: 216
2021: 274
2022: 259
2023: 321
2024: 247
2025: 153
2026: 29
Papers
Near-Optimal Goal-Oriented Reinforcement Learning in Non-Stationary Environments
NIPS 2022
PALMER: Perception - Action Loop with Memory for Long-Horizon Planning
NIPS 2022
DNA: Proximal Policy Optimization with a Dual Network Architecture
NIPS 2022
Lyapunov Design for Robust and Efficient Robotic Reinforcement Learning
CORL 2022
Global Convergence of Direct Policy Search for State-Feedback $\mathcal{H}_\infty$ Robust Control: A Revisit of Nonsmooth Synthesis with Goldstein Subdifferential
NIPS 2022
Regret Bounds for Risk-Sensitive Reinforcement Learning
NIPS 2022
Reinforcement Learning Explainability via Model Transforms (Student Abstract)
AAAI 2022
How to Reduce Action Space for Planning Domains? (Student Abstract)
AAAI 2022
Adaptive Pairwise Weights for Temporal Credit Assignment
AAAI 2022
Robust Action Gap Increasing with Clipped Advantage Learning
AAAI 2022
MDPGT: Momentum-Based Decentralized Policy Gradient Tracking
AAAI 2022
Goal Recognition as Reinforcement Learning
AAAI 2022
Model-Based Offline Planning with Trajectory Pruning
IJCAI 2022
Offline-to-Online Co-Evolutional User Simulator and Dialogue System
EMNLP 2022
Efficient Risk-Averse Reinforcement Learning
NIPS 2022
Episodic Policy Gradient Training
AAAI 2022
Same State, Different Task: Continual Reinforcement Learning without Interference
AAAI 2022
Lifelong Hyper-Policy Optimization with Multiple Importance Sampling Regularization
AAAI 2022
Theoretical Guarantees of Fictitious Discount Algorithms for Episodic Reinforcement Learning and Global Convergence of Policy Gradient Methods
AAAI 2022
Constraint Sampling Reinforcement Learning: Incorporating Expertise for Faster Learning
AAAI 2022
Unsupervised Reinforcement Learning in Multiple Environments
AAAI 2022
Reinforcement Learning with Stochastic Reward Machines
AAAI 2022
Distillation of RL Policies with Formal Guarantees via Variational Abstraction of Markov Decision Processes
AAAI 2022
Context-Specific Representation Abstraction for Deep Option Learning
AAAI 2022
Admissible Policy Teaching through Reward Design
AAAI 2022
<
1
…
34
35
36
…
83
>