conftrace_

← Optimization & Theory

Machine Learning › Optimization & Theory ›

Neural Network Optimization

3,648 papers

Papers per year

Papers

Don’t Predict Counterfactual Values, Predict Expected Values Instead AAAI 2023

AutoInit: Analytic Signal-Preserving Weight Initialization for Neural Networks AAAI 2023

On the Stability and Generalization of Triplet Learning AAAI 2023

Stability-Based Generalization Analysis of the Asynchronous Decentralized SGD AAAI 2023

EffConv: Efficient Learning of Kernel Sizes for Convolution Layers of CNNs AAAI 2023

RLEKF: An Optimizer for Deep Potential with Ab Initio Accuracy AAAI 2023

WLD-Reg: A Data-Dependent Within-Layer Diversity Regularizer AAAI 2023

Learning Compact Features via In-Training Representation Alignment AAAI 2023

Implicit Stochastic Gradient Descent for Training Physics-Informed Neural Networks AAAI 2023

Fast Saturating Gate for Learning Long Time Scales with Recurrent Neural Networks AAAI 2023

Backpropagation-Free Deep Learning with Recursive Local Representation Alignment AAAI 2023

Continual Learning with Scaled Gradient Projection AAAI 2023

Fixed-Weight Difference Target Propagation AAAI 2023

Fast Convergence in Learning Two-Layer Neural Networks with Separable Data AAAI 2023

Linear Regularizers Enforce the Strict Saddle Property AAAI 2023

The Implicit Regularization of Momentum Gradient Descent in Overparametrized Models AAAI 2023

Transfer Learning Enhanced DeepONet for Long-Time Prediction of Evolution Equations AAAI 2023

Lottery Pools: Winning More by Interpolating Tickets without Increasing Training or Inference Cost AAAI 2023

Acceleration of Large Transformer Model Training by Sensitivity-Based Layer Dropping AAAI 2023

DRGCN: Dynamic Evolving Initial Residual for Deep Graph Convolutional Networks AAAI 2023

CowClip: Reducing CTR Prediction Model Training Time from 12 Hours to 10 Minutes on 1 GPU AAAI 2023

Diffuser: Efficient Transformers with Multi-Hop Attention Diffusion for Long Sequences AAAI 2023

Demystify the Gravity Well in the Optimization Landscape (Student Abstract) AAAI 2023

Backforward Propagation (Student Abstract) AAAI 2023

Fine-tuning Happens in Tiny Subspaces: Exploring Intrinsic Task-specific Subspaces of Pre-trained Language Models ACL 2023