Offline RL
726 papers
Papers per year
2
1
1
1
2
3
2
6
4
8
29
60
105
129
187
126
37
23
'15
'20
'25
Papers
Self-Imitation Learning
ICML 2018
Toward Minimax Off-policy Value Estimation
AISTATS 2015
Regularized Off-Policy TD-Learning
NIPS 2012
The Fixed Points of Off-Policy TD
NIPS 2011