Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Optimization & Theory
Deep Learning
›
Optimization & Theory
›
Neural Network Optimization
902 directly classified papers
Papers per year
2007: 1
2009: 1
2010: 2
2011: 1
2012: 3
2013: 4
2014: 1
2015: 9
2016: 14
2017: 20
2018: 30
2019: 66
2020: 127
2021: 106
2022: 117
2023: 106
2024: 190
2025: 100
2026: 4
Papers
Generalization Performance of Multi-pass Stochastic Gradient Descent with Convex Loss Functions
JMLR 2021
Accelerating Ill-Conditioned Low-Rank Matrix Estimation via Scaled Gradient Descent
JMLR 2021
On ADMM in Deep Learning: Convergence and Saturation-Avoidance
JMLR 2021
Multiplicative Noise and Heavy Tails in Stochastic Optimization
ICML 2021
Orthogonal Over-Parameterized Training
CVPR 2021
Rethinking BiSeNet for Real-Time Semantic Segmentation
CVPR 2021
Network Quantization With Element-Wise Gradient Scaling
CVPR 2021
Train simultaneously, generalize better: Stability of gradient-based minimax learners
ICML 2021
Attention is not all you need: pure attention loses rank doubly exponentially with depth
ICML 2021
On Energy-Based Models with Overparametrized Shallow Neural Networks
ICML 2021
Adversarial Robustness Guarantees for Random Deep Neural Networks
ICML 2021
Bayesian Deep Learning via Subnetwork Inference
ICML 2021
On the Adequacy of Untuned Warmup for Adaptive Optimization
AAAI 2021
ReconfigISP: Reconfigurable Camera Image Processing Pipeline
ICCV 2021
ReZero is all you need: fast convergence at large depth
UAI 2021
Sparse linear networks with a fixed butterfly structure: theory and practice
UAI 2021
Certifying Incremental Quadratic Constraints for Neural Networks via Convex Optimization
L4DC 2021
Spectral Normalisation for Deep Reinforcement Learning: An Optimisation Perspective
ICML 2021
Zero Time Waste: Recycling Predictions in Early Exit Neural Networks
NIPS 2021
The Heavy-Tail Phenomenon in SGD
ICML 2021
ASAM: Adaptive Sharpness-Aware Minimization for Scale-Invariant Learning of Deep Neural Networks
ICML 2021
Differentiable Spike: Rethinking Gradient-Descent for Training Spiking Neural Networks
NIPS 2021
Loss Surface Simplexes for Mode Connecting Volumes and Fast Ensembling
ICML 2021
Better Training using Weight-Constrained Stochastic Dynamics
ICML 2021
On Monotonic Linear Interpolation of Neural Network Parameters
ICML 2021
<
1
…
23
24
25
…
37
>