Dongcui Diao
2 papers · 2009–2023 · 2 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+1 more ↓ Show less ↑
๐ Interdisciplinary Bridge ๐งญ Keyword Pioneer ๐ Conference Polyglot (2) ๐ Academic Marathon (14) ๐ Cross-Pollinator (9)
๐ฃ
Hot Topic Early Bird
Conferences
AAAI (1)
NIPS (1)
Top co-authors
Keywords
policy evaluation
(1)
policy gradient
(1)
value function
(1)
model-based reinforcement learning
(1)
off-policy learning
(1)
multi-step planning
(1)
dyna architecture
(1)
lambda model
(1)
proximal policy optimization
(1)
conservative policy iteration
(1)
multi-step learning
(1)
dyna planning
(1)
soft clipping
(1)