Co-occurring keywords
reinforcement learning
(4122)
temporal difference learning
(149)
value function
(294)
offline reinforcement learning
(492)
causal inference
(1619)
function approximation
(319)
off-policy learning
(227)
markov decision process
(788)
temporal-difference learning
(42)
linear function approximation
(101)
Papers
Policy Evaluation in Distributional LQR
L4DC 2023
Provably Fast Convergence of Independent Natural Policy Gradient for Markov Potential Games
NIPS 2023
Learning Good State and Action Representations for Markov Decision Process via Tensor Decomposition
JMLR 2023
On the Assumptions of Synthetic Control Methods
AISTATS 2022
Offline Policy Selection under Uncertainty
AISTATS 2022
Sample Complexity of Policy-Based Methods under Off-Policy Sampling and Linear Function Approximation
AISTATS 2022