Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Machine Learning
›
Learning Types
›
Reinforcement Learning
2932 directly classified papers
Papers per year
2003: 1
2006: 11
2007: 18
2008: 23
2009: 14
2010: 22
2011: 24
2012: 34
2013: 26
2014: 24
2015: 14
2016: 23
2017: 79
2018: 182
2019: 255
2020: 284
2021: 333
2022: 319
2023: 315
2024: 457
2025: 419
2026: 55
Papers
Variance Reduced Policy Evaluation with Smooth Function Approximation
NIPS 2019
Privacy-Preserving Q-Learning with Functional Noise in Continuous Spaces
NIPS 2019
Learning Data Manipulation for Augmentation and Weighting
NIPS 2019
Learning from Trajectories via Subgoal Discovery
NIPS 2019
Loaded DiCE: Trading off Bias and Variance in Any-Order Score Function Gradient Estimators for Reinforcement Learning
NIPS 2019
Adaptive Temporal-Difference Learning for Policy Evaluation with Per-State Uncertainty Estimates
NIPS 2019
Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards
NIPS 2019
Near-Optimal Reinforcement Learning in Dynamic Treatment Regimes
NIPS 2019
Limiting Extrapolation in Linear Approximate Value Iteration
NIPS 2019
Provably Global Convergence of Actor-Critic: A Case for Linear Quadratic Regulator with Ergodic Cost
NIPS 2019
Importance Resampling for Off-policy Prediction
NIPS 2019
Regret Bounds for Learning State Representations in Reinforcement Learning
NIPS 2019
Policy Learning for Fairness in Ranking
NIPS 2019
Regret Minimization for Reinforcement Learning by Evaluating the Optimal Bias Function
NIPS 2019
Reconciling λ-Returns with Experience Replay
NIPS 2019
Finite-Sample Analysis for SARSA with Linear Function Approximation
NIPS 2019
Regret Minimization for Reinforcement Learning with Vectorial Feedback and Complex Objectives
NIPS 2019
Online Stochastic Shortest Path with Bandit Feedback and Unknown Transition Function
NIPS 2019
A Family of Robust Stochastic Operators for Reinforcement Learning
NIPS 2019
Trust Region-Guided Proximal Policy Optimization
NIPS 2019
Almost Horizon-Free Structure-Aware Best Policy Identification with a Generative Model
NIPS 2019
Finite-time Analysis of Approximate Policy Iteration for the Linear Quadratic Regulator
NIPS 2019
Beyond Confidence Regions: Tight Bayesian Ambiguity Sets for Robust MDPs
NIPS 2019
Goal-conditioned Imitation Learning
NIPS 2019
Game Design for Eliciting Distinguishable Behavior
NIPS 2019
<
1
…
89
90
91
…
118
>