Yaozhong Gan
8 papers · 2019–2026 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (5) 🏃 Academic Marathon (6) 🐝 Cross-Pollinator (14) 🧭 Keyword Pioneer
🐣
Hot Topic Early Bird
🏆
Grand Slam
Conferences
AAAI (4)
ICCV (1)
ICLR (1)
ICML (1)
NIPS (1)
Top co-authors
Keywords
advantage learning
(2)
action gap
(2)
temporal difference
(2)
policy optimization
(2)
reinforcement learning
(2)
multi-agent reinforcement learning
(2)
proximal policy optimization
(1)
diffusion policy
(1)
training stability
(1)
trust region
(1)
denoising process
(1)
quality-diversity trade-off
(1)
value-based reinforcement learning
(1)
soft mellowmax operator
(1)
q learning
(1)
mellowmax operator
(1)
bellman optimal operator
(1)
clipped advantage
(1)
value convergence
(1)
action-gap regularization
(1)
Papers
MARPO: A Reflective Policy Optimization for Multi-Agent Reinforcement Learning
AAAI 2026
Entropy-Adaptive Diffusion Policy Optimization with Dynamic Step Alignment
ICCV 2025
PAE: Reinforcement Learning from External Knowledge for Efficient Exploration
ICLR 2024
Reflective Policy Optimization
ICML 2024
Smoothing Advantage Learning
AAAI 2022
Robust Action Gap Increasing with Clipped Advantage Learning
AAAI 2022
Stabilizing Q Learning Via Soft Mellowmax Operator
AAAI 2021
Trust Region-Guided Proximal Policy Optimization
NIPS 2019