haiyin piao
4 papers · 2021–2024 · 2 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (10) π Interdisciplinary Bridge π Cross-Pollinator (15) π Conference Polyglot (2) π§ Keyword Pioneer
π£
Hot Topic Early Bird
π
Trend Setter
Conferences
AAAI (2)
NIPS (2)
Top co-authors
Keywords
multi-agent reinforcement learning
(2)
proximal policy optimization
(2)
policy gradient
(2)
uncertainty quantification
(1)
policy learning
(1)
multi-agent learning
(1)
continuous control
(1)
off-policy learning
(1)
policy improvement
(1)
reward uncertainty
(1)
exploration strategy
(1)
optimism-based exploration
(1)
optimism in face of uncertainty
(1)
credit assignment
(1)
reward estimation
(1)
conservative policy iteration
(1)
monotonic improvement
(1)
value distribution
(1)
noise-aware exploration
(1)
soft clipping
(1)
Papers
OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments
AAAI 2024
The Sufficiency of Off-Policyness and Soft Clipping: PPO Is Still Insufficient according to an Off-Policy Measure
AAAI 2023
Distributional Reward Estimation for Effective Multi-agent Deep Reinforcement Learning
NIPS 2022
Coordinated Proximal Policy Optimization
NIPS 2021