Deep RL
3,886 papers
Papers per year
1
9
14
15
9
21
27
32
21
17
10
33
102
222
399
450
533
478
532
513
326
122
'05
'10
'15
'20
'25
Papers
Value Pursuit Iteration
NIPS 2012
Multi-objective Monte-Carlo Tree Search
ACML 2012
Contextual Bandit Learning with Predictable Rewards
AISTATS 2012
Optimistic planning for Markov decision processes
AISTATS 2012
Policy Gradient Coagent Networks
NIPS 2011
Speedy Q-Learning
NIPS 2011