conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Optimization & Theory
Machine Learning
›
Optimization & Theory
›
Neural Network Optimization
3,648 papers
Papers per year
2001: 1
2003: 1
2005: 2
2006: 3
2007: 6
2008: 1
2009: 7
2010: 5
2011: 7
2012: 9
2013: 17
2014: 18
2015: 40
2016: 76
2017: 113
2018: 214
2019: 324
2020: 414
2021: 489
2022: 445
2023: 524
2024: 469
2025: 386
2026: 77
Papers
CAILA: Concept-Aware Intra-Layer Adapters for Compositional Zero-Shot Learning
WACV 2024
MixtureGrowth: Growing Neural Networks by Recombining Learned Parameters
WACV 2024
Pruning From Scratch via Shared Pruning Module and Nuclear Norm-Based Regularization
WACV 2024
Neural Echos: Depthwise Convolutional Filters Replicate Biological Receptive Fields
WACV 2024
FLORA: Fine-Grained Low-Rank Architecture Search for Vision Transformer
WACV 2024
Wino Vidi Vici: Conquering Numerical Instability of 8-Bit Winograd Convolution for Accurate Inference Acceleration on Edge
WACV 2024
Towards Better Structured Pruning Saliency by Reorganizing Convolution
WACV 2024
Smoothing the Landscape Boosts the Signal for SGD: Optimal Sample Complexity for Learning Single Index Models
NIPS 2023
Sharpness Minimization Algorithms Do Not Only Minimize Sharpness To Achieve Better Generalization
NIPS 2023
Feature-Learning Networks Are Consistent Across Widths At Realistic Scales
NIPS 2023
On the Overlooked Pitfalls of Weight Decay and How to Mitigate Them: A Gradient-Norm Perspective
NIPS 2023
Fixing the NTK: From Neural Network Linearizations to Exact Convex Programs
NIPS 2023
A Variational Perspective on High-Resolution ODEs
NIPS 2023
Uniform-in-Time Wasserstein Stability Bounds for (Noisy) Stochastic Gradient Descent
NIPS 2023
Winner-Take-All Column Row Sampling for Memory Efficient Adaptation of Language Model
NIPS 2023
A Computationally Efficient Sparsified Online Newton Method
NIPS 2023
DoWG Unleashed: An Efficient Universal Parameter-Free Gradient Descent Method
NIPS 2023
Saddle-to-Saddle Dynamics in Diagonal Linear Networks
NIPS 2023
Accelerated Quasi-Newton Proximal Extragradient: Faster Rate for Smooth Convex Optimization
NIPS 2023
How Does Adaptive Optimization Impact Local Neural Network Geometry?
NIPS 2023
Minimum norm interpolation by perceptra: Explicit regularization and implicit bias
NIPS 2023
The Double-Edged Sword of Implicit Bias: Generalization vs. Robustness in ReLU Networks
NIPS 2023
Inconsistency, Instability, and Generalization Gap of Deep Neural Network Training
NIPS 2023
Dynamics of Finite Width Kernel and Prediction Fluctuations in Mean Field Neural Networks
NIPS 2023
On Single-Index Models beyond Gaussian Data
NIPS 2023
<
1
…
37
38
39
…
146
>