Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Optimization & Theory
Machine Learning
›
Optimization & Theory
›
Neural Network Optimization
3648 directly classified papers
Papers per year
2001: 1
2003: 1
2005: 2
2006: 3
2007: 6
2008: 1
2009: 7
2010: 5
2011: 7
2012: 9
2013: 17
2014: 18
2015: 40
2016: 76
2017: 113
2018: 214
2019: 324
2020: 414
2021: 489
2022: 445
2023: 524
2024: 469
2025: 386
2026: 77
Papers
Optimization Theory for ReLU Neural Networks Trained with Normalization Layers
ICML 2020
Zeno++: Robust Fully Asynchronous SGD
ICML 2020
Efficient Derivative Computation for Cumulative B-Splines on Lie Groups
CVPR 2020
Dynamics of Deep Neural Networks and Neural Tangent Hierarchy
ICML 2020
Task-Level Curriculum Learning for Non-Autoregressive Neural Machine Translation
IJCAI 2020
Long Document Ranking with Query-Directed Sparse Transformer
EMNLP 2020
Learning Rank-1 Diffractive Optics for Single-Shot High Dynamic Range Imaging
CVPR 2020
Sideways: Depth-Parallel Training of Video Models
CVPR 2020
Curvature-corrected learning dynamics in deep neural networks
ICML 2020
Do We Need Zero Training Loss After Achieving Zero Training Error?
ICML 2020
SelectScale: Mining More Patterns from Images via Selective and Soft Dropout
IJCAI 2020
Extrapolation for Large-batch Training in Deep Learning
ICML 2020
Filter Grafting for Deep Neural Networks
CVPR 2020
Dynamic Hierarchical Mimicking Towards Consistent Optimization Objectives
CVPR 2020
An Internal Covariate Shift Bounding Algorithm for Deep Neural Networks by Unitizing Layers' Outputs
CVPR 2020
All in One Bad Weather Removal Using Architectural Search
CVPR 2020
Beyond exploding and vanishing gradients: analysing RNN training using attractors and smoothness
AISTATS 2020
Learning Rate Adaptation for Differentially Private Learning
AISTATS 2020
Adversarial Risk Bounds through Sparsity based Compression
AISTATS 2020
The Devil Is in the Details: Delving Into Unbiased Data Processing for Human Pose Estimation
CVPR 2020
Improving Transformer Optimization Through Better Initialization
ICML 2020
AdaScale SGD: A User-Friendly Algorithm for Distributed Training
ICML 2020
Exploiting Neuron and Synapse Filter Dynamics in Spatial Temporal Learning of Deep Spiking Neural Network
IJCAI 2020
Universal Average-Case Optimality of Polyak Momentum
ICML 2020
Adaptive Checkpoint Adjoint Method for Gradient Estimation in Neural ODE
ICML 2020
<
1
…
100
101
102
…
146
>