← Optimization & Theory

Deep Learning › Optimization & Theory ›

Neural Network Optimization

902 directly classified papers

Papers per year

Papers

A Modern Look at the Relationship between Sharpness and Generalization ICML 2023

SGD with Large Step Sizes Learns Sparse Features ICML 2023

AdaNorm: Adaptive Gradient Norm Correction Based Optimizer for CNNs WACV 2023

DoG is SGD’s Best Friend: A Parameter-Free Dynamic Step Size Schedule ICML 2023

Can Forward Gradient Match Backpropagation? ICML 2023

Achieving High Accuracy with PINNs via Energy Natural Gradient Descent ICML 2023

Understanding Representation Learnability of Nonlinear Self-Supervised Learning AAAI 2023

The Implicit Regularization of Momentum Gradient Descent in Overparametrized Models AAAI 2023

Fast Saturating Gate for Learning Long Time Scales with Recurrent Neural Networks AAAI 2023

Fast Convergence in Learning Two-Layer Neural Networks with Separable Data AAAI 2023

Experimental Observations of the Topology of Convolutional Neural Network Activations AAAI 2023

Zero-Cost Operation Scoring in Differentiable Architecture Search AAAI 2023

Backpropagation-Free Deep Learning with Recursive Local Representation Alignment AAAI 2023

AIO-P: Expanding Neural Performance Predictors beyond Image Classification AAAI 2023

A Length-Extrapolatable Transformer ACL 2023

DMIS: Dynamic Mesh-Based Importance Sampling for Training Physics-Informed Neural Networks AAAI 2023

Can We Find Strong Lottery Tickets in Generative Models? AAAI 2023

Self-Evolution Learning for Discriminative Language Model Pretraining ACL 2023

A Unified Approach to Controlling Implicit Regularization via Mirror Descent JMLR 2023

Improved Powered Stochastic Optimization Algorithms for Large-Scale Machine Learning JMLR 2023

An Inexact Augmented Lagrangian Algorithm for Training Leaky ReLU Neural Network with Group Sparsity JMLR 2023

Implicit Bias of Gradient Descent for Mean Squared Error Regression with Two-Layer Wide Neural Networks JMLR 2023

On the Convergence of Stochastic Gradient Descent with Bandwidth-based Step Size JMLR 2023

Regression as Classification: Influence of Task Formulation on Neural Network Features AISTATS 2023

Curvature-Balanced Feature Manifold Learning for Long-Tailed Classification CVPR 2023