Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Policy Learning
2068 directly classified papers
Papers per year
2002: 6
2003: 1
2004: 1
2006: 11
2007: 10
2008: 14
2009: 9
2010: 23
2011: 15
2012: 25
2013: 25
2014: 24
2015: 23
2016: 27
2017: 61
2018: 107
2019: 187
2020: 216
2021: 274
2022: 259
2023: 321
2024: 247
2025: 153
2026: 29
Papers
Off-Policy Average Reward Actor-Critic with Deterministic Policy Search
ICML 2023
Spotlight News Driven Quantitative Trading Based on Trajectory Optimization
IJCAI 2023
ConvLab-3: A Flexible Dialogue System Toolkit Based on a Unified Data Format
EMNLP 2023
Model Predictive Control via On-Policy Imitation Learning
L4DC 2023
Building Persona Consistent Dialogue Agents with Offline Reinforcement Learning
EMNLP 2023
InitLight: Initial Model Generation for Traffic Signal Control Using Adversarial Inverse Reinforcement Learning
IJCAI 2023
Posterior Sampling for Deep Reinforcement Learning
ICML 2023
Revisiting Bellman Errors for Offline Model Selection
ICML 2023
On the Occupancy Measure of Non-Markovian Policies in Continuous MDPs
ICML 2023
Policy Regularization with Dataset Constraint for Offline Reinforcement Learning
ICML 2023
Towards Hierarchical Policy Learning for Conversational Recommendation with Hypergraph-based Reinforcement Learning
IJCAI 2023
Learning Compiler Pass Orders using Coreset and Normalized Value Prediction
ICML 2023
Nearly Minimax Optimal Reinforcement Learning for Linear Markov Decision Processes
ICML 2023
A Pragmatic Look at Deep Imitation Learning
ACML 2023
Logarithmic regret in communicating MDPs: Leveraging known dynamics with bandits
ACML 2023
Beyond Reward: Offline Preference-guided Policy Optimization
ICML 2023
Sample Efficient Model-free Reinforcement Learning from LTL Specifications with Optimality Guarantees
IJCAI 2023
Modified Policy Iteration for Exponential Cost Risk Sensitive MDPs
L4DC 2023
A Reinforcement Learning Look at Risk-Sensitive Linear Quadratic Gaussian Control
L4DC 2023
Practical Critic Gradient based Actor Critic for On-Policy Reinforcement Learning
L4DC 2023
Provable Hierarchy-Based Meta-Reinforcement Learning
AISTATS 2023
Reinforcement Learning with Stepwise Fairness Constraints
AISTATS 2023
Continuous Versatile Jumping Using Learned Action Residuals
L4DC 2023
Hierarchical State Abstraction based on Structural Information Principles
IJCAI 2023
On The Convergence Of Policy Iteration-Based Reinforcement Learning With Monte Carlo Policy Evaluation
AISTATS 2023
<
1
…
21
22
23
…
83
>