Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Optimization & Theory
Deep Learning
›
Optimization & Theory
›
Neural Network Optimization
902 directly classified papers
Papers per year
2007: 1
2009: 1
2010: 2
2011: 1
2012: 3
2013: 4
2014: 1
2015: 9
2016: 14
2017: 20
2018: 30
2019: 66
2020: 127
2021: 106
2022: 117
2023: 106
2024: 190
2025: 100
2026: 4
Papers
A Modern Look at the Relationship between Sharpness and Generalization
ICML 2023
SGD with Large Step Sizes Learns Sparse Features
ICML 2023
AdaNorm: Adaptive Gradient Norm Correction Based Optimizer for CNNs
WACV 2023
DoG is SGD’s Best Friend: A Parameter-Free Dynamic Step Size Schedule
ICML 2023
Can Forward Gradient Match Backpropagation?
ICML 2023
Achieving High Accuracy with PINNs via Energy Natural Gradient Descent
ICML 2023
Understanding Representation Learnability of Nonlinear Self-Supervised Learning
AAAI 2023
The Implicit Regularization of Momentum Gradient Descent in Overparametrized Models
AAAI 2023
Fast Saturating Gate for Learning Long Time Scales with Recurrent Neural Networks
AAAI 2023
Fast Convergence in Learning Two-Layer Neural Networks with Separable Data
AAAI 2023
Experimental Observations of the Topology of Convolutional Neural Network Activations
AAAI 2023
Zero-Cost Operation Scoring in Differentiable Architecture Search
AAAI 2023
Backpropagation-Free Deep Learning with Recursive Local Representation Alignment
AAAI 2023
AIO-P: Expanding Neural Performance Predictors beyond Image Classification
AAAI 2023
A Length-Extrapolatable Transformer
ACL 2023
DMIS: Dynamic Mesh-Based Importance Sampling for Training Physics-Informed Neural Networks
AAAI 2023
Can We Find Strong Lottery Tickets in Generative Models?
AAAI 2023
Self-Evolution Learning for Discriminative Language Model Pretraining
ACL 2023
A Unified Approach to Controlling Implicit Regularization via Mirror Descent
JMLR 2023
Improved Powered Stochastic Optimization Algorithms for Large-Scale Machine Learning
JMLR 2023
An Inexact Augmented Lagrangian Algorithm for Training Leaky ReLU Neural Network with Group Sparsity
JMLR 2023
Implicit Bias of Gradient Descent for Mean Squared Error Regression with Two-Layer Wide Neural Networks
JMLR 2023
On the Convergence of Stochastic Gradient Descent with Bandwidth-based Step Size
JMLR 2023
Regression as Classification: Influence of Task Formulation on Neural Network Features
AISTATS 2023
Curvature-Balanced Feature Manifold Learning for Long-Tailed Classification
CVPR 2023
<
1
…
13
14
15
…
37
>