2025 ICML ICML 2025

Action-Dependent Optimality-Preserving Reward Shaping