conftrace_

← Optimization & Theory

Machine Learning › Optimization & Theory ›

Neural Network Optimization

3,648 papers

Papers per year

Papers

CAILA: Concept-Aware Intra-Layer Adapters for Compositional Zero-Shot Learning WACV 2024

MixtureGrowth: Growing Neural Networks by Recombining Learned Parameters WACV 2024

Pruning From Scratch via Shared Pruning Module and Nuclear Norm-Based Regularization WACV 2024

Neural Echos: Depthwise Convolutional Filters Replicate Biological Receptive Fields WACV 2024

FLORA: Fine-Grained Low-Rank Architecture Search for Vision Transformer WACV 2024

Wino Vidi Vici: Conquering Numerical Instability of 8-Bit Winograd Convolution for Accurate Inference Acceleration on Edge WACV 2024

Towards Better Structured Pruning Saliency by Reorganizing Convolution WACV 2024

Smoothing the Landscape Boosts the Signal for SGD: Optimal Sample Complexity for Learning Single Index Models NIPS 2023

Sharpness Minimization Algorithms Do Not Only Minimize Sharpness To Achieve Better Generalization NIPS 2023

Feature-Learning Networks Are Consistent Across Widths At Realistic Scales NIPS 2023

On the Overlooked Pitfalls of Weight Decay and How to Mitigate Them: A Gradient-Norm Perspective NIPS 2023

Fixing the NTK: From Neural Network Linearizations to Exact Convex Programs NIPS 2023

A Variational Perspective on High-Resolution ODEs NIPS 2023

Uniform-in-Time Wasserstein Stability Bounds for (Noisy) Stochastic Gradient Descent NIPS 2023

Winner-Take-All Column Row Sampling for Memory Efficient Adaptation of Language Model NIPS 2023

A Computationally Efficient Sparsified Online Newton Method NIPS 2023

DoWG Unleashed: An Efficient Universal Parameter-Free Gradient Descent Method NIPS 2023

Saddle-to-Saddle Dynamics in Diagonal Linear Networks NIPS 2023

Accelerated Quasi-Newton Proximal Extragradient: Faster Rate for Smooth Convex Optimization NIPS 2023

How Does Adaptive Optimization Impact Local Neural Network Geometry? NIPS 2023

Minimum norm interpolation by perceptra: Explicit regularization and implicit bias NIPS 2023

The Double-Edged Sword of Implicit Bias: Generalization vs. Robustness in ReLU Networks NIPS 2023

Inconsistency, Instability, and Generalization Gap of Deep Neural Network Training NIPS 2023

Dynamics of Finite Width Kernel and Prediction Fluctuations in Mean Field Neural Networks NIPS 2023

On Single-Index Models beyond Gaussian Data NIPS 2023