stochastic gradient descent
1088 papers
Also known as
SGD
ASGD
SAGA
SGM
SGDA
PSGD
SKGD
Co-occurring keywords
Papers
Positive-Negative Momentum: Manipulating Stochastic Gradient Noise to Improve Generalization
ICML 2021
When All We Need is a Piece of the Pie: A Generic Framework for Optimizing Two-way Partial AUC
ICML 2021
Exponential Convergence Rates of Classification Errors on Learning with SGD and Random Features
AISTATS 2021
vqSGD: Vector Quantized Stochastic Gradient Descent
AISTATS 2021
(Nearly) Dimension Independent Private ERM with AdaGrad Rates\{via Publicly Estimated Subspaces
COLT 2021
CADA: Communication-Adaptive Distributed Adam
AISTATS 2021
Elastic Consistency: A Practical Consistency Model for Distributed Stochastic Gradient Descent
AAAI 2021
Is SGD a Bayesian sampler? Well, almost
JMLR 2021