Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Optimization & Theory
Machine Learning
›
Optimization & Theory
›
Neural Network Optimization
3648 directly classified papers
Papers per year
2001: 1
2003: 1
2005: 2
2006: 3
2007: 6
2008: 1
2009: 7
2010: 5
2011: 7
2012: 9
2013: 17
2014: 18
2015: 40
2016: 76
2017: 113
2018: 214
2019: 324
2020: 414
2021: 489
2022: 445
2023: 524
2024: 469
2025: 386
2026: 77
Papers
Convergence Analysis of Gradient Descent for Eigenvector Computation
IJCAI 2018
Refine or Represent: Residual Networks with Explicit Channel-wise Configuration
IJCAI 2018
Stochastic Second-Order Method for Large-Scale Nonconvex Sparse Learning Models
IJCAI 2018
The Context-Dependent Additive Recurrent Neural Net
NAACL 2018
Dense Information Flow for Neural Machine Translation
NAACL 2018
Lyapunov Functions for First-Order Methods: Tight Automated Convergence Guarantees
ICML 2018
Stochastic Variance-Reduced Cubic Regularized Newton Methods
ICML 2018
On the Optimization of Deep Networks: Implicit Acceleration by Overparameterization
ICML 2018
Comparing Dynamics: Deep Neural Networks versus Glassy Systems
ICML 2018
signSGD: Compressed Optimisation for Non-Convex Problems
ICML 2018
A Progressive Batching L-BFGS Method for Machine Learning
ICML 2018
Dynamical Isometry and a Mean Field Theory of RNNs: Gating Enables Signal Propagation in Recurrent Neural Networks
ICML 2018
Orthogonal Recurrent Neural Networks with Scaled Cayley Transform
ICML 2018
On the Implicit Bias of Dropout
ICML 2018
Stabilizing Gradients for Deep Neural Networks via Efficient SVD Parameterization
ICML 2018
Learning Long Term Dependencies via Fourier Recurrent Units
ICML 2018
Optimization based Layer-wise Magnitude-based Pruning for DNN Compression
IJCAI 2018
Escaping Saddles with Stochastic Gradients
ICML 2018
Experienced Optimization with Reusable Directional Model for Hyper-Parameter Search
IJCAI 2018
Neural Ranking Models for Temporal Dependency Structure Parsing
EMNLP 2018
PiCANet: Learning Pixel-Wise Contextual Attention for Saliency Detection
CVPR 2018
A PID Controller Approach for Stochastic Optimization of Deep Networks
CVPR 2018
Progressive Blockwise Knowledge Distillation for Neural Network Acceleration
IJCAI 2018
Scale-Recurrent Network for Deep Image Deblurring
CVPR 2018
MorphNet: Fast & Simple Resource-Constrained Structure Learning of Deep Networks
CVPR 2018
<
1
…
128
129
130
…
146
>