shangding gu
5 papers · 2021–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
π Renaissance Researcher (5) π§ Keyword Pioneer π Cross-Pollinator (10) π Conference Polyglot (4) π Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(14)
π₯
Mega-Team
(32)
Conferences
NIPS (2)
AAAI (1)
EMNLP (1)
ICLR (1)
Top co-authors
Keywords
policy optimization
(2)
safe reinforcement learning
(2)
benchmark evaluation
(1)
policy gradient
(1)
multilingual nlp
(1)
convergence analysis
(1)
constraint satisfaction
(1)
variance reduction
(1)
mirror descent
(1)
optimal baseline
(1)
low-resource language
(1)
convergence guarantee
(1)
gradient estimator
(1)
cost minimization
(1)
gradient manipulation
(1)
large language model
(1)
multi-agent policy gradient
(1)
reward safety optimization
(1)
policy adjustment
(1)
cross-lingual reasoning
(1)
Papers
MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation
EMNLP 2025
Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning
ICLR 2025
Enhancing Efficiency of Safe Reinforcement Learning via Sample Manipulation
NIPS 2024
Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation
AAAI 2024
Settling the Variance of Multi-Agent Policy Gradients
NIPS 2021