Co-occurring keywords
Papers
Stochastic Gradient Succeeds for Bandits
ICML 2023
Stochastic Policy Gradient Methods: Improved Sample Complexity for Fisher-non-degenerate Policies
ICML 2023
On the Global Convergence of Fitted Q-Iteration with Two-layer Neural Network Parametrization
ICML 2023
Over-Parameterization Exponentially Slows Down Gradient Descent for Learning a Single Neuron
COLT 2023
Beyond the Policy Gradient Theorem for Efficient Policy Updates in Actor-Critic Algorithms
AISTATS 2022