Yixiu Mao
8 papers · 2021–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
🌍 Conference Polyglot (4) 🧭 Keyword Pioneer 🐝 Cross-Pollinator (14) 🌉 Interdisciplinary Bridge 🏆 Keyword Champion (3)
🏆
Grand Slam
👑
Triple Crown
Conferences
NIPS (3)
ICLR (2)
ICML (2)
AAAI (1)
Top co-authors
Keywords
offline reinforcement learning
(4)
out-of-distribution action
(3)
extrapolation error
(3)
policy learning
(2)
dynamic programming
(1)
policy iteration
(1)
action selection
(1)
policy improvement
(1)
behavior policy
(1)
value overestimation
(1)
in-sample learning
(1)
mild generalization
(1)
out-of-distribution state
(1)
credit assignment
(1)
episodic reinforcement learning
(1)
trust region policy optimization
(1)
large language model
(1)
reward redistribution
(1)
support constraint
(1)
latent reward
(1)
Papers
Fast and Robust: Task Sampling with Posterior and Diversity Synergies for Adaptive Decision-Makers in Randomized Environments
ICML 2025
Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning
AAAI 2025
Doubly Mild Generalization for Offline Reinforcement Learning
NIPS 2024
Offline Reinforcement Learning with OOD State Correction and OOD Action Suppression
NIPS 2024
Supported Value Regularization for Offline Reinforcement Learning
NIPS 2023
In-sample Actor Critic for Offline Reinforcement Learning
ICLR 2023
Supported Trust Region Optimization for Offline Reinforcement Learning
ICML 2023
A Hypergradient Approach to Robust Regression without Correspondence
ICLR 2021