Policy Learning
2,076 papers
Papers per year
6
1
1
11
10
14
9
23
15
25
25
24
23
27
61
107
187
216
274
259
321
247
153
37
'10
'15
'20
'25
Papers
Reward-Constrained Behavior Cloning
IJCAI 2021
Hindsight Trust Region Policy Optimization
IJCAI 2021
Independence-aware Advantage Estimation
IJCAI 2021
Causal Confusion Reduction for Robust Multi-Domain Dialogue Policy
INTERSPEECH 2021