conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Optimization & Theory
Machine Learning
›
Optimization & Theory
›
Neural Network Optimization
3,648 papers
Papers per year
2001: 1
2003: 1
2005: 2
2006: 3
2007: 6
2008: 1
2009: 7
2010: 5
2011: 7
2012: 9
2013: 17
2014: 18
2015: 40
2016: 76
2017: 113
2018: 214
2019: 324
2020: 414
2021: 489
2022: 445
2023: 524
2024: 469
2025: 386
2026: 77
Papers
Algorithmic Stability of Heavy-Tailed SGD with General Loss Functions
ICML 2023
How much does Initialization Affect Generalization?
ICML 2023
Feature learning in deep classifiers through Intermediate Neural Collapse
ICML 2023
Neural networks trained with SGD learn distributions of increasing complexity
ICML 2023
End-to-End Learning for Stochastic Optimization: A Bayesian Perspective
ICML 2023
Dissecting the Effects of SGD Noise in Distinct Regimes of Deep Learning
ICML 2023
Fundamental Limits of Two-layer Autoencoders, and Achieving Them with Gradient Methods
ICML 2023
Gradient Descent in Neural Networks as Sequential Learning in Reproducing Kernel Banach Space
ICML 2023
The Dormant Neuron Phenomenon in Deep Reinforcement Learning
ICML 2023
Dynamic Regularized Sharpness Aware Minimization in Federated Learning: Approaching Global Consistency and Smooth Landscape
ICML 2023
Momentum Ensures Convergence of SIGNSGD under Weaker Assumptions
ICML 2023
A Neural PDE Solver with Temporal Stencil Modeling
ICML 2023
Learning Neural PDE Solvers with Parameter-Guided Channel Attention
ICML 2023
Perturbation Analysis of Neural Collapse
ICML 2023
Expected Gradients of Maxout Networks and Consequences to Parameter Initialization
ICML 2023
Low-Variance Gradient Estimation in Unrolled Computation Graphs with ES-Single
ICML 2023
Adaptive Smoothing Gradient Learning for Spiking Neural Networks
ICML 2023
CircuitNet: A Generic Neural Network to Realize Universal Circuit Motif Modeling
ICML 2023
Generalized Polyak Step Size for First Order Optimization with Momentum
ICML 2023
Polarity Is All You Need to Learn and Transfer Faster
ICML 2023
Robustly Learning a Single Neuron via Sharpness
ICML 2023
PPG Reloaded: An Empirical Study on What Matters in Phasic Policy Gradient
ICML 2023
NTK-approximating MLP Fusion for Efficient Language Model Fine-tuning
ICML 2023
Optimizing Mode Connectivity for Class Incremental Learning
ICML 2023
The Implicit Regularization of Dynamical Stability in Stochastic Gradient Descent
ICML 2023
<
1
…
54
55
56
…
146
>