Martin J Wainwright
4 papers · 2018–2022 · 2 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓
π
Conference Polyglot
(2)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(10)
π§
Keyword Pioneer
π
Cross-Pollinator
(14)
Conferences
COLT (2)
NIPS (2)
Top co-authors
Keywords
offline reinforcement learning
(2)
policy optimization
(1)
function approximation
(1)
markov chain monte carlo
(1)
bellman residual
(1)
polyak-ruppert averaging
(1)
mixing time
(1)
bellman operator
(1)
minimax lower bound
(1)
confidence interval
(1)
log-concave sampling
(1)
langevin diffusion
(1)
metropolis-adjusted langevin algorithm
(1)
central limit theorem
(1)
pessimism principle
(1)
linear stochastic approximation
(1)
temporal difference algorithm
(1)
strongly log-concave density
(1)
non-asymptotic concentration
(1)
off-policy evaluation
(1)
Papers
Bellman Residual Orthogonalization for Offline Reinforcement Learning
NIPS 2022
Provable Benefits of Actor-Critic Methods for Offline Reinforcement Learning
NIPS 2021
On Linear Stochastic Approximation: Fine-grained Polyak-Ruppert and Non-Asymptotic Concentration
COLT 2020
Log-concave sampling: Metropolis-Hastings algorithms are fast!
COLT 2018