Washim Uddin Mondal
9 papers · 2022–2025 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
π Cross-Pollinator (11) π Interdisciplinary Bridge π Conference Polyglot (6) π§ Keyword Pioneer π Renaissance Researcher (5)
πΊοΈ
Taxonomy Completionist
(12)
β
The Questioner
Conferences
JMLR (2)
NIPS (2)
UAI (2)
AAAI (1)
AISTATS (1)
ICML (1)
Top co-authors
Keywords
natural policy gradient
(4)
multi-agent reinforcement learning
(3)
average reward
(2)
policy gradient
(2)
sample complexity
(2)
markov decision process
(2)
constrained markov decision process
(2)
regret bound
(2)
cooperative multi-agent
(2)
constraint violation
(2)
mean field control
(2)
global optimal policy
(1)
primal-dual method
(1)
sample efficiency
(1)
cooperative multi-agent system
(1)
non-uniform interaction
(1)
heterogeneous agent
(1)
approximation guarantee
(1)
doubly stochastic matrix
(1)
global optimality
(1)
Papers
Order-Optimal Global Convergence for Actor-Critic with General Policy and Neural Critic Parametrization
UAI 2025
Order-Optimal Regret with Novel Policy Gradient Approaches in Infinite-Horizon Average Reward MDPs
AISTATS 2025
A Sharper Global Convergence Analysis for Average Reward Reinforcement Learning via an Actor-Critic Approach
ICML 2025
Sample-Efficient Constrained Reinforcement Learning with General Parameterization
NIPS 2024
Mean-Field Approximation of Cooperative Constrained Multi-Agent Reinforcement Learning (CMARL)
JMLR 2024
Learning General Parameterized Policies for Infinite Horizon Average Reward Constrained MDPs via Primal-Dual Policy Gradient Algorithm
NIPS 2024
Regret Analysis of Policy Gradient Algorithm for Infinite Horizon Average Reward Markov Decision Processes
AAAI 2024
Can mean field control (mfc) approximate cooperative multi agent reinforcement learning (marl) with non-uniform interaction?
UAI 2022
On the Approximation of Cooperative Heterogeneous Multi-Agent Reinforcement Learning (MARL) using Mean Field Control (MFC)
JMLR 2022