Co-occurring keywords
Papers
On the Global Convergence of Gradient Descent for Over-parameterized Models using Optimal Transport
NIPS 2018
Step Size Matters in Deep Learning
NIPS 2018
On the Fine-Grained Complexity of Empirical Risk Minimization: Kernel Methods and Neural Networks
NIPS 2017
Repeat before Forgetting: Spaced Repetition for Efficient and Effective Training of Neural Networks
EMNLP 2017
Sobolev Training for Neural Networks
NIPS 2017
Convergent Block Coordinate Descent for Training Tikhonov Regularized Deep Neural Networks
NIPS 2017