Co-occurring keywords
Papers
Control Batch Size and Learning Rate to Generalize Well: Theoretical and Empirical Evidence
NIPS 2019
Combining Global Sparse Gradients with Local Gradients in Distributed Neural Network Training
EMNLP 2019
SAGA with Arbitrary Sampling
ICML 2019
SGD: General Analysis and Improved Rates
ICML 2019
DoubleSqueeze: Parallel Stochastic Gradient Descent with Double-pass Error-Compensated Compression
ICML 2019