Qinbo Bai
7 papers · 2021–2024 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓
π§
Keyword Pioneer
π
Conference Polyglot
(5)
π
Cross-Pollinator
(6)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(12)
Conferences
AAAI (3)
AISTATS (1)
JMLR (1)
NIPS (1)
UAI (1)
Top co-authors
Keywords
constrained markov decision process
(6)
sample complexity
(3)
policy gradient
(2)
regret bound
(2)
markov decision process
(2)
zero constraint violation
(2)
average reward
(2)
primal-dual algorithm
(2)
reinforcement learning
(2)
constraint violation
(2)
multi-objective optimization
(1)
zero-sum game
(1)
bandit optimization
(1)
primal-dual method
(1)
multi-objective reinforcement learning
(1)
model-free algorithm
(1)
peak constraint
(1)
posterior sampling algorithm
(1)
regret bound analysis
(1)
long-term average constraint
(1)
Papers
Regret Analysis of Policy Gradient Algorithm for Infinite Horizon Average Reward Markov Decision Processes
AAAI 2024
Learning General Parameterized Policies for Infinite Horizon Average Reward Constrained MDPs via Primal-Dual Policy Gradient Algorithm
NIPS 2024
Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Conservative Natural Policy Gradient Primal-Dual Algorithm
AAAI 2023
Provably Sample-Efficient Model-Free Algorithm for MDPs with Peak Constraints
JMLR 2023
Regret guarantees for model-based reinforcement learning with long-term average constraints
UAI 2022
Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach
AAAI 2022
Reinforcement Learning for Constrained Markov Decision Processes
AISTATS 2021