conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Optimization & Theory
Machine Learning
›
Optimization & Theory
›
Neural Network Optimization
3,648 papers
Papers per year
2001: 1
2003: 1
2005: 2
2006: 3
2007: 6
2008: 1
2009: 7
2010: 5
2011: 7
2012: 9
2013: 17
2014: 18
2015: 40
2016: 76
2017: 113
2018: 214
2019: 324
2020: 414
2021: 489
2022: 445
2023: 524
2024: 469
2025: 386
2026: 77
Papers
FlatMatch: Bridging Labeled Data and Unlabeled Data with Cross-Sharpness for Semi-Supervised Learning
NIPS 2023
Self-Attentive Pooling for Efficient Deep Learning
WACV 2023
Local Temperature Beam Search: Avoid Neural Text DeGeneration via Enhanced Calibration
ACL 2023
Rethinking the Role of Scale for In-Context Learning: An Interpretability-based Case Study at 66 Billion Scale
ACL 2023
ATHENA: Mathematical Reasoning with Thought Expansion
EMNLP 2023
RecursiveDet: End-to-End Region-Based Recursive Object Detection
ICCV 2023
The Double-Edged Sword of Implicit Bias: Generalization vs. Robustness in ReLU Networks
NIPS 2023
Beyond Lipschitz Smoothness: A Tighter Analysis for Nonconvex Optimization
ICML 2023
Learning-Rate-Free Learning by D-Adaptation
ICML 2023
On the Convergence of Continual Learning with Adaptive Methods
UAI 2023
Learning an Explicit Hyper-parameter Prediction Function Conditioned on Tasks
JMLR 2023
MixPath: A Unified Approach for One-shot Neural Architecture Search
ICCV 2023
Stability-Based Generalization Analysis of the Asynchronous Decentralized SGD
AAAI 2023
Fundamental Limits of Two-layer Autoencoders, and Achieving Them with Gradient Methods
ICML 2023
Accelerated Training via Incrementally Growing Neural Networks using Variance Transfer and Learning Rate Adaptation
NIPS 2023
Fixing the NTK: From Neural Network Linearizations to Exact Convex Programs
NIPS 2023
Global Convergence Analysis of Local SGD for Two-layer Neural Network without Overparameterization
NIPS 2023
Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding
EMNLP 2023
Training biologically plausible recurrent neural networks on cognitive tasks with long-term dependencies
NIPS 2023
GeNAS: Neural Architecture Search with Better Generalization
IJCAI 2023
SERF: Towards Better Training of Deep Neural Networks Using Log-Softplus ERror Activation Function
WACV 2023
On the Dynamics Under the Unhinged Loss and Beyond
JMLR 2023
ROME: Robustifying Memory-Efficient NAS via Topology Disentanglement and Gradient Accumulation
ICCV 2023
Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training
NIPS 2023
Continuous Spatiotemporal Transformer
ICML 2023
<
1
…
42
43
44
…
146
>