conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Optimization & Theory
Machine Learning
›
Optimization & Theory
›
Neural Network Optimization
3,648 papers
Papers per year
2001: 1
2003: 1
2005: 2
2006: 3
2007: 6
2008: 1
2009: 7
2010: 5
2011: 7
2012: 9
2013: 17
2014: 18
2015: 40
2016: 76
2017: 113
2018: 214
2019: 324
2020: 414
2021: 489
2022: 445
2023: 524
2024: 469
2025: 386
2026: 77
Papers
Nyström Method for Accurate and Scalable Implicit Differentiation
AISTATS 2023
AlphaD3M: An Open-Source AutoML Library for Multiple ML Tasks
AUTOML 2023
Gradient Descent in Neural Networks as Sequential Learning in Reproducing Kernel Banach Space
ICML 2023
Physics-Informed Model-Based Reinforcement Learning
L4DC 2023
Towards Deep Attention in Graph Neural Networks: Problems and Remedies
ICML 2023
PPG Reloaded: An Empirical Study on What Matters in Phasic Policy Gradient
ICML 2023
Provable Advantage of Curriculum Learning on Parity Targets with Mixed Inputs
NIPS 2023
A General Regret Bound of Preconditioned Gradient Method for DNN Training
CVPR 2023
Searching for Robust Binary Neural Networks via Bimodal Parameter Perturbation
WACV 2023
Learning Physics-Informed Neural Networks without Stacked Back-propagation
AISTATS 2023
How much does Initialization Affect Generalization?
ICML 2023
On the Overlooked Structure of Stochastic Gradients
NIPS 2023
Understanding Plasticity in Neural Networks
ICML 2023
Don’t blame Dataset Shift! Shortcut Learning due to Gradients and Cross Entropy
NIPS 2023
Gradient Norm Aware Minimization Seeks First-Order Flatness and Improves Generalization
CVPR 2023
Principled Weight Initialisation for Input-Convex Neural Networks
NIPS 2023
Analyzing Convergence in Quantum Neural Networks: Deviations from Neural Tangent Kernels
ICML 2023
Fast Convergence in Learning Two-Layer Neural Networks with Separable Data
AAAI 2023
Trained MT Metrics Learn to Cope with Machine-translated References
EMNLP 2023
When to Use Efficient Self Attention? Profiling Text, Speech and Image Transformer Variants
ACL 2023
Super-Resolution Neural Operator
CVPR 2023
Why Does Sharpness-Aware Minimization Generalize Better Than SGD?
NIPS 2023
Dynamic Tensor Decomposition via Neural Diffusion-Reaction Processes
NIPS 2023
CircuitNet: A Generic Neural Network to Realize Universal Circuit Motif Modeling
ICML 2023
SAAL: Sharpness-Aware Active Learning
ICML 2023
<
1
…
46
47
48
…
146
>