conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Optimization & Theory
Machine Learning
›
Optimization & Theory
›
Neural Network Optimization
3,648 papers
Papers per year
2001: 1
2003: 1
2005: 2
2006: 3
2007: 6
2008: 1
2009: 7
2010: 5
2011: 7
2012: 9
2013: 17
2014: 18
2015: 40
2016: 76
2017: 113
2018: 214
2019: 324
2020: 414
2021: 489
2022: 445
2023: 524
2024: 469
2025: 386
2026: 77
Papers
Dynamics of Finite Width Kernel and Prediction Fluctuations in Mean Field Neural Networks
NIPS 2023
Automatic Integration for Fast and Interpretable Neural Point Processes
L4DC 2023
A Fully First-Order Method for Stochastic Bilevel Optimization
ICML 2023
Implicit Jacobian regularization weighted with impurity of probability output
ICML 2023
On Over-Squashing in Message Passing Neural Networks: The Impact of Width, Depth, and Topology
ICML 2023
Implicit Regularization in Over-Parameterized Support Vector Machine
NIPS 2023
Improved Convergence in High Probability of Clipped Gradient Methods with Heavy Tailed Noise
NIPS 2023
Binarizing Sparse Convolutional Networks for Efficient Point Cloud Analysis
CVPR 2023
Scalable Real-Time Recurrent Learning Using Columnar-Constructive Networks
JMLR 2023
Beyond NTK with Vanilla Gradient Descent: A Mean-Field Analysis of Neural Networks with Polynomial Width, Samples, and Time
NIPS 2023
How Reliable Are AI-Generated-Text Detectors? An Assessment Framework Using Evasive Soft Prompts
EMNLP 2023
AGD: an Auto-switchable Optimizer using Stepwise Gradient Difference for Preconditioning Matrix
NIPS 2023
Accelerated Training via Incrementally Growing Neural Networks using Variance Transfer and Learning Rate Adaptation
NIPS 2023
Feature-Learning Networks Are Consistent Across Widths At Realistic Scales
NIPS 2023
Better Loss Landscape Visualization for Deep Neural Networks with Trajectory Information
ACML 2023
Robust Linear Regression: Gradient-descent, Early-stopping, and Beyond
AISTATS 2023
Achieving $\mathcal{O}(\epsilon^{-1.5})$ Complexity in Hessian/Jacobian-free Stochastic Bilevel Optimization
NIPS 2023
A Theoretical Analysis of the Learning Dynamics under Class Imbalance
ICML 2023
Intractability of Learning the Discrete Logarithm with Gradient-Based Methods
ACML 2023
Hard Sample Aware Prompt-Tuning
ACL 2023
Continuous-time Analysis of Anchor Acceleration
NIPS 2023
On the spectral bias of two-layer linear networks
NIPS 2023
Generalization bounds for neural ordinary differential equations and deep residual networks
NIPS 2023
Why Random Pruning Is All We Need to Start Sparse
ICML 2023
DINER: Disorder-Invariant Implicit Neural Representation
CVPR 2023
<
1
…
39
40
41
…
146
>