conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Optimization & Theory
Machine Learning
›
Optimization & Theory
›
Neural Network Optimization
3,648 papers
Papers per year
2001: 1
2003: 1
2005: 2
2006: 3
2007: 6
2008: 1
2009: 7
2010: 5
2011: 7
2012: 9
2013: 17
2014: 18
2015: 40
2016: 76
2017: 113
2018: 214
2019: 324
2020: 414
2021: 489
2022: 445
2023: 524
2024: 469
2025: 386
2026: 77
Papers
Don’t Predict Counterfactual Values, Predict Expected Values Instead
AAAI 2023
AutoInit: Analytic Signal-Preserving Weight Initialization for Neural Networks
AAAI 2023
On the Stability and Generalization of Triplet Learning
AAAI 2023
Stability-Based Generalization Analysis of the Asynchronous Decentralized SGD
AAAI 2023
EffConv: Efficient Learning of Kernel Sizes for Convolution Layers of CNNs
AAAI 2023
RLEKF: An Optimizer for Deep Potential with Ab Initio Accuracy
AAAI 2023
WLD-Reg: A Data-Dependent Within-Layer Diversity Regularizer
AAAI 2023
Learning Compact Features via In-Training Representation Alignment
AAAI 2023
Implicit Stochastic Gradient Descent for Training Physics-Informed Neural Networks
AAAI 2023
Fast Saturating Gate for Learning Long Time Scales with Recurrent Neural Networks
AAAI 2023
Backpropagation-Free Deep Learning with Recursive Local Representation Alignment
AAAI 2023
Continual Learning with Scaled Gradient Projection
AAAI 2023
Fixed-Weight Difference Target Propagation
AAAI 2023
Fast Convergence in Learning Two-Layer Neural Networks with Separable Data
AAAI 2023
Linear Regularizers Enforce the Strict Saddle Property
AAAI 2023
The Implicit Regularization of Momentum Gradient Descent in Overparametrized Models
AAAI 2023
Transfer Learning Enhanced DeepONet for Long-Time Prediction of Evolution Equations
AAAI 2023
Lottery Pools: Winning More by Interpolating Tickets without Increasing Training or Inference Cost
AAAI 2023
Acceleration of Large Transformer Model Training by Sensitivity-Based Layer Dropping
AAAI 2023
DRGCN: Dynamic Evolving Initial Residual for Deep Graph Convolutional Networks
AAAI 2023
CowClip: Reducing CTR Prediction Model Training Time from 12 Hours to 10 Minutes on 1 GPU
AAAI 2023
Diffuser: Efficient Transformers with Multi-Hop Attention Diffusion for Long Sequences
AAAI 2023
Demystify the Gravity Well in the Optimization Landscape (Student Abstract)
AAAI 2023
Backforward Propagation (Student Abstract)
AAAI 2023
Fine-tuning Happens in Tiny Subspaces: Exploring Intrinsic Task-specific Subspaces of Pre-trained Language Models
ACL 2023
<
1
…
44
45
46
…
146
>