temporal difference learning
149 papers
Also known as
TD
TD LEARNING
TD-LEARNING
LSTD
GTD
Co-occurring keywords
Papers
Sample Complexity of Policy-Based Methods under Off-Policy Sampling and Linear Function Approximation
AISTATS 2022
Stochastic linear optimization never overfits with quadratically-bounded losses on general data
COLT 2022
A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning
JMLR 2022
Expected Eligibility Traces
AAAI 2021