conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Optimization & Theory
Machine Learning
›
Optimization & Theory
›
Neural Network Optimization
3,648 papers
Papers per year
2001: 1
2003: 1
2005: 2
2006: 3
2007: 6
2008: 1
2009: 7
2010: 5
2011: 7
2012: 9
2013: 17
2014: 18
2015: 40
2016: 76
2017: 113
2018: 214
2019: 324
2020: 414
2021: 489
2022: 445
2023: 524
2024: 469
2025: 386
2026: 77
Papers
A study on constraining Connectionist Temporal Classification for temporal audio alignment
INTERSPEECH 2022
End-to-End Neural Speaker Diarization with an Iterative Refinement of Non-Autoregressive Attention-based Attractors
INTERSPEECH 2022
Deep Learning in Target Space
JMLR 2022
A Stochastic Bundle Method for Interpolation
JMLR 2022
On Biased Stochastic Gradient Estimation
JMLR 2022
Accelerated Zeroth-Order and First-Order Momentum Methods from Mini to Minimax Optimization
JMLR 2022
Beyond Sub-Gaussian Noises: Sharp Concentration Analysis for Stochastic Gradient Descent
JMLR 2022
Overparameterization of Deep ResNet: Zero Loss and Mean-field Analysis
JMLR 2022
Batch Normalization Preconditioning for Neural Network Training
JMLR 2022
Accelerating Adaptive Cubic Regularization of Newton's Method via Random Sampling
JMLR 2022
Reverse-mode differentiation in arbitrary tensor network format: with application to supervised learning
JMLR 2022
Implicit Differentiation for Fast Hyperparameter Selection in Non-Smooth Convex Learning
JMLR 2022
Learning Rates as a Function of Batch Size: A Random Matrix Theory Approach to Neural Network Training
JMLR 2022
Training Two-Layer ReLU Networks with Gradient Descent is Inconsistent
JMLR 2022
Learning to Optimize: A Primer and A Benchmark
JMLR 2022
Learning Operators with Coupled Attention
JMLR 2022
Towards Practical Adam: Non-Convexity, Convergence Theory, and Mini-Batch Acceleration
JMLR 2022
Simple and Optimal Stochastic Gradient Methods for Nonsmooth Nonconvex Optimization
JMLR 2022
On Constraints in First-Order Optimization: A View from Non-Smooth Dynamical Systems
JMLR 2022
A proof of convergence for the gradient descent optimization method with random initializations in the training of neural networks with ReLU activation for piecewise linear target functions
JMLR 2022
Oracle Complexity in Nonsmooth Nonconvex Optimization
JMLR 2022
Early Stopping for Iterative Regularization with General Loss Functions
JMLR 2022
OpReg-Boost: Learning to Accelerate Online Algorithms with Operator Regression
L4DC 2022
Total Energy Shaping with Neural Interconnection and Damping Assignment - Passivity Based Control
L4DC 2022
Input-to-State Stable Neural Ordinary Differential Equations with Applications to Transient Modeling of Circuits
L4DC 2022
<
1
…
74
75
76
…
146
>