conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Optimization & Theory
Machine Learning
›
Optimization & Theory
›
Neural Network Optimization
3,648 papers
Papers per year
2001: 1
2003: 1
2005: 2
2006: 3
2007: 6
2008: 1
2009: 7
2010: 5
2011: 7
2012: 9
2013: 17
2014: 18
2015: 40
2016: 76
2017: 113
2018: 214
2019: 324
2020: 414
2021: 489
2022: 445
2023: 524
2024: 469
2025: 386
2026: 77
Papers
JEPOO: Highly Accurate Joint Estimation of Pitch, Onset and Offset for Music Information Retrieval
IJCAI 2023
Scaling Laws for Multilingual Neural Machine Translation
ICML 2023
DoWG Unleashed: An Efficient Universal Parameter-Free Gradient Descent Method
NIPS 2023
Lookaround Optimizer: $k$ steps around, 1 step average
NIPS 2023
Symbolic Discovery of Optimization Algorithms
NIPS 2023
Over-parameterized Deep Nonparametric Regression for Dependent Data with Its Applications to Reinforcement Learning
JMLR 2023
Meta-AdaM: An Meta-Learned Adaptive Optimizer with Momentum for Few-Shot Learning
NIPS 2023
Backpropagation-Free Deep Learning with Recursive Local Representation Alignment
AAAI 2023
STay-ON-the-Ridge: Guaranteed Convergence to Local Minimax Equilibrium in Nonconvex-Nonconcave Games
COLT 2023
Neural Architecture Search for Parameter-Efficient Fine-tuning of Large Pre-trained Language Models
ACL 2023
Transfer Learning Enhanced DeepONet for Long-Time Prediction of Evolution Equations
AAAI 2023
Learned Adapters Are Better Than Manually Designed Adapters
ACL 2023
ConDaFormer: Disassembled Transformer with Local Structure Enhancement for 3D Point Cloud Understanding
NIPS 2023
Bifurcations and loss jumps in RNN training
NIPS 2023
A Variational Perspective on High-Resolution ODEs
NIPS 2023
Japanese-to-English Simultaneous Dubbing Prototype
ACL 2023
Backpropagation of Unrolled Solvers with Folded Optimization
IJCAI 2023
Implicit Stochastic Gradient Descent for Training Physics-Informed Neural Networks
AAAI 2023
Deep linear networks can benignly overfit when shallow ones do
JMLR 2023
Can Forward Gradient Match Backpropagation?
ICML 2023
Convergence of Proximal Point and Extragradient-Based Methods Beyond Monotonicity: the Case of Negative Comonotonicity
ICML 2023
A More Accurate Internal Language Model Score Estimation for the Hybrid Autoregressive Transducer
INTERSPEECH 2023
Intriguing Properties of Quantization at Scale
NIPS 2023
Smoothing the Landscape Boosts the Signal for SGD: Optimal Sample Complexity for Learning Single Index Models
NIPS 2023
A future for universal grapheme-phoneme transduction modeling with neuralized finite-state transducers
ACL 2023
<
1
…
43
44
45
…
146
>