conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Optimization & Theory
Machine Learning
›
Optimization & Theory
›
Neural Network Optimization
3,648 papers
Papers per year
2001: 1
2003: 1
2005: 2
2006: 3
2007: 6
2008: 1
2009: 7
2010: 5
2011: 7
2012: 9
2013: 17
2014: 18
2015: 40
2016: 76
2017: 113
2018: 214
2019: 324
2020: 414
2021: 489
2022: 445
2023: 524
2024: 469
2025: 386
2026: 77
Papers
Implicit Regularization with Polynomial Growth in Deep Tensor Factorization
ICML 2022
DyRep: Bootstrapping Training With Dynamic Re-Parameterization
CVPR 2022
Accelerated Zeroth-Order and First-Order Momentum Methods from Mini to Minimax Optimization
JMLR 2022
Deep equilibrium networks are sensitive to initialization statistics
ICML 2022
Towards Understanding Sharpness-Aware Minimization
ICML 2022
Robust Training of Neural Networks Using Scale Invariant Architectures
ICML 2022
Efficient Adversarial Training with Robust Early-Bird Tickets
EMNLP 2022
Improving Neural Ordinary Differential Equations with Nesterov's Accelerated Gradient Method
NIPS 2022
BitFit: Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models
ACL 2022
High-dimensional limit theorems for SGD: Effective dynamics and critical scaling
NIPS 2022
Towards Scaling Difference Target Propagation by Learning Backprop Targets
ICML 2022
Recent Advances on Neural Network Pruning at Initialization
IJCAI 2022
Model Doctor: A Simple Gradient Aggregation Strategy for Diagnosing and Treating CNN Classifiers
AAAI 2022
TO-FLOW: Efficient Continuous Normalizing Flows With Temporal Optimization Adjoint With Moving Speed
CVPR 2022
SAPipe: Staleness-Aware Pipeline for Data Parallel DNN Training
NIPS 2022
Long-range Sequence Modeling with Predictable Sparse Attention
ACL 2022
High Probability Guarantees for Nonconvex Stochastic Gradient Descent with Heavy Tails
ICML 2022
End-to-End Neural Speaker Diarization with an Iterative Refinement of Non-Autoregressive Attention-based Attractors
INTERSPEECH 2022
Understanding the unstable convergence of gradient descent
ICML 2022
A Closer Look at Learned Optimization: Stability, Robustness, and Inductive Biases
NIPS 2022
Towards Joint Intent Detection and Slot Filling via Higher-order Attention
IJCAI 2022
XPrompt: Exploring the Extreme of Prompt Tuning
EMNLP 2022
Learning Distributions Generated by Single-Layer ReLU Networks in the Presence of Arbitrary Outliers
NIPS 2022
HNO: High-Order Numerical Architecture for ODE-Inspired Deep Unfolding Networks
AAAI 2022
Thrifty Neural Architecture Search for Medical Image Segmentation (Student Abstract)
AAAI 2022
<
1
…
67
68
69
…
146
>