Jalaj Bhandari
5 papers · 2017–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+1 more ↓ Show less ↑
🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (4) 🏃 Academic Marathon (8) 🐝 Cross-Pollinator (8) 🧭 Keyword Pioneer
🐣
Hot Topic Early Bird
Conferences
AISTATS (2)
COLT (1)
ICML (1)
JMLR (1)
Top co-authors
Keywords
markov decision process
(2)
reinforcement learning
(2)
policy optimization
(1)
policy gradient
(1)
belief propagation
(1)
convergence analysis
(1)
policy learning
(1)
ising model
(1)
markov chain monte carlo
(1)
value function
(1)
policy iteration
(1)
linear function approximation
(1)
partial observability
(1)
auxiliary variable
(1)
linear convergence
(1)
nonlinear optimization
(1)
finite mdp
(1)
finite time analysis
(1)
production deployment
(1)
binary model
(1)
Papers
Aligned Multi Objective Optimization
ICML 2025
Pearl: A Production-Ready Reinforcement Learning Agent
JMLR 2024
On the Linear Convergence of Policy Gradient Methods for Finite MDPs
AISTATS 2021
A Finite Time Analysis of Temporal Difference Learning With Linear Function Approximation
COLT 2018
Annular Augmentation Sampling
AISTATS 2017