conftrace_

← Optimization & Theory

Machine Learning › Optimization & Theory ›

Neural Network Optimization

3,648 papers

Papers per year

Papers

Implicit Regularization with Polynomial Growth in Deep Tensor Factorization ICML 2022

DyRep: Bootstrapping Training With Dynamic Re-Parameterization CVPR 2022

Accelerated Zeroth-Order and First-Order Momentum Methods from Mini to Minimax Optimization JMLR 2022

Deep equilibrium networks are sensitive to initialization statistics ICML 2022

Towards Understanding Sharpness-Aware Minimization ICML 2022

Robust Training of Neural Networks Using Scale Invariant Architectures ICML 2022

Efficient Adversarial Training with Robust Early-Bird Tickets EMNLP 2022

Improving Neural Ordinary Differential Equations with Nesterov's Accelerated Gradient Method NIPS 2022

BitFit: Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models ACL 2022

High-dimensional limit theorems for SGD: Effective dynamics and critical scaling NIPS 2022

Towards Scaling Difference Target Propagation by Learning Backprop Targets ICML 2022

Recent Advances on Neural Network Pruning at Initialization IJCAI 2022

Model Doctor: A Simple Gradient Aggregation Strategy for Diagnosing and Treating CNN Classifiers AAAI 2022

TO-FLOW: Efficient Continuous Normalizing Flows With Temporal Optimization Adjoint With Moving Speed CVPR 2022

SAPipe: Staleness-Aware Pipeline for Data Parallel DNN Training NIPS 2022

Long-range Sequence Modeling with Predictable Sparse Attention ACL 2022

High Probability Guarantees for Nonconvex Stochastic Gradient Descent with Heavy Tails ICML 2022

End-to-End Neural Speaker Diarization with an Iterative Refinement of Non-Autoregressive Attention-based Attractors INTERSPEECH 2022

Understanding the unstable convergence of gradient descent ICML 2022

A Closer Look at Learned Optimization: Stability, Robustness, and Inductive Biases NIPS 2022

Towards Joint Intent Detection and Slot Filling via Higher-order Attention IJCAI 2022

XPrompt: Exploring the Extreme of Prompt Tuning EMNLP 2022

Learning Distributions Generated by Single-Layer ReLU Networks in the Presence of Arbitrary Outliers NIPS 2022

HNO: High-Order Numerical Architecture for ODE-Inspired Deep Unfolding Networks AAAI 2022

Thrifty Neural Architecture Search for Medical Image Segmentation (Student Abstract) AAAI 2022