Papers
Policy Gradient in Continuous Time
JMLR 2006
Bayesian Policy Gradient Algorithms
NIPS 2006
Least-Squares Policy Iteration
JMLR 2003
Policy Search using Paired Comparisons
JMLR 2002
ε-MDPs: Learning in Varying Environments
JMLR 2002