stochastic gradient descent
1088 papers
Also known as
SGD
ASGD
SAGA
SGM
SGDA
PSGD
SKGD
Co-occurring keywords
Papers
On the Convergence of (Stochastic) Gradient Descent with Extrapolation for Non-Convex Minimization
IJCAI 2019
Two Tiered Distributed Training Algorithm for Acoustic Modeling
INTERSPEECH 2019
Trading Redundancy for Communication: Speeding up Distributed SGD for Non-convex Optimization
ICML 2019
Combining Global Sparse Gradients with Local Gradients in Distributed Neural Network Training
IJCNLP 2019