Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Machine Learning
›
Learning Types
›
Reinforcement Learning
2932 directly classified papers
Papers per year
2003: 1
2006: 11
2007: 18
2008: 23
2009: 14
2010: 22
2011: 24
2012: 34
2013: 26
2014: 24
2015: 14
2016: 23
2017: 79
2018: 182
2019: 255
2020: 284
2021: 333
2022: 319
2023: 315
2024: 457
2025: 419
2026: 55
Papers
Organizing Experience: a Deeper Look at Replay Mechanisms for Sample-Based Planning in Continuous State Domains
IJCAI 2018
Scalable Initial State Interdiction for Factored MDPs
IJCAI 2018
Dynamic Resource Routing using Real-Time Dynamic Programming
IJCAI 2018
Autonomously Reusing Knowledge in Multiagent Reinforcement Learning
IJCAI 2018
Interactive Learning and Decision Making: Foundations, Insights & Challenges
IJCAI 2018
Improving Reinforcement Learning with Human Input
IJCAI 2018
Towards Sample Efficient Reinforcement Learning
IJCAI 2018
Coordinated Exploration in Concurrent Reinforcement Learning
ICML 2018
Smoothed Action Value Functions for Learning Gaussian Policies
ICML 2018
Time Limits in Reinforcement Learning
ICML 2018
Learning by Playing Solving Sparse Reward Tasks from Scratch
ICML 2018
An Inference-Based Policy Gradient Method for Learning Options
ICML 2018
Structured Control Nets for Deep Reinforcement Learning
ICML 2018
Learning Safe Policies with Expert Guidance
NIPS 2018
Learning Globally Optimized Object Detector via Policy Gradient
CVPR 2018
Geometrically Coupled Monte Carlo Sampling
NIPS 2018
Reinforcement Learning with Multiple Experts: A Bayesian Model Combination Approach
NIPS 2018
Differentiable MPC for End-to-end Planning and Control
NIPS 2018
Near-Optimal Time and Sample Complexities for Solving Markov Decision Processes with a Generative Model
NIPS 2018
A Deep Bayesian Policy Reuse Approach Against Non-Stationary Agents
NIPS 2018
Variational Inverse Control with Events: A General Framework for Data-Driven Reward Definition
NIPS 2018
Is Q-Learning Provably Efficient?
NIPS 2018
Exploration in Structured Reinforcement Learning
NIPS 2018
Deep Generative Models with Learnable Knowledge Constraints
NIPS 2018
Near-Optimal Policies for Dynamic Multinomial Logit Assortment Selection Models
NIPS 2018
<
1
…
102
103
104
…
118
>