stochastic gradient descent
1088 papers
Also known as
SGD
ASGD
SAGA
SGM
SGDA
PSGD
SKGD
Co-occurring keywords
Papers
Leveraging Continuous Time to Understand Momentum When Training Diagonal Linear Networks
AISTATS 2024
Fast Forwarding Low-Rank Training
EMNLP 2024
A Bregman Proximal Stochastic Gradient Method with Extrapolation for Nonconvex Nonsmooth Problems
AAAI 2024
Removing Data Heterogeneity Influence Enhances Network Topology Dependence of Decentralized SGD
JMLR 2023
Mixtures of All Trees
AISTATS 2023
A General Theory for Federated Optimization with Asynchronous and Heterogeneous Clients Updates
JMLR 2023