conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Optimization & Theory
Machine Learning
›
Optimization & Theory
›
Neural Network Optimization
3,648 papers
Papers per year
2001: 1
2003: 1
2005: 2
2006: 3
2007: 6
2008: 1
2009: 7
2010: 5
2011: 7
2012: 9
2013: 17
2014: 18
2015: 40
2016: 76
2017: 113
2018: 214
2019: 324
2020: 414
2021: 489
2022: 445
2023: 524
2024: 469
2025: 386
2026: 77
Papers
AlignAtt: Using Attention-based Audio-Translation Alignments as a Guide for Simultaneous Speech Translation
INTERSPEECH 2023
Deep Reinforcement Learning with Plasticity Injection
NIPS 2023
Self-Attentive Pooling for Efficient Deep Learning
WACV 2023
Intractability of Learning the Discrete Logarithm with Gradient-Based Methods
ACML 2023
Learning-Rate-Free Learning by D-Adaptation
ICML 2023
Efficient Hyper-parameter Optimization with Cubic Regularization
NIPS 2023
Polarity Is All You Need to Learn and Transfer Faster
ICML 2023
(S)GD over Diagonal Linear Networks: Implicit bias, Large Stepsizes and Edge of Stability
NIPS 2023
Continuous-time Analysis of Anchor Acceleration
NIPS 2023
Generalization bounds for neural ordinary differential equations and deep residual networks
NIPS 2023
MixPath: A Unified Approach for One-shot Neural Architecture Search
ICCV 2023
Separable Physics-Informed Neural Networks
NIPS 2023
Non-autoregressive Streaming Transformer for Simultaneous Translation
EMNLP 2023
Capturing the Long-Distance Dependency in the Control Flow Graph via Structural-Guided Attention for Bug Localization
IJCAI 2023
Fixing the NTK: From Neural Network Linearizations to Exact Convex Programs
NIPS 2023
Toward Edge-Efficient Dense Predictions With Synergistic Multi-Task Neural Architecture Search
WACV 2023
Recurrence Without Recurrence: Stable Video Landmark Detection With Deep Equilibrium Models
CVPR 2023
Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL
NIPS 2023
SGD with Large Step Sizes Learns Sparse Features
ICML 2023
Improving Multi-Fidelity Optimization With a Recurring Learning Rate for Hyperparameter Tuning
WACV 2023
Neural Architecture Search for Parameter-Efficient Fine-tuning of Large Pre-trained Language Models
ACL 2023
Finding the Pillars of Strength for Multi-Head Attention
ACL 2023
Accelerated Training via Incrementally Growing Neural Networks using Variance Transfer and Learning Rate Adaptation
NIPS 2023
Growing a Brain with Sparsity-Inducing Generation for Continual Learning
ICCV 2023
Decentralized Learning: Theoretical Optimality and Practical Improvements
JMLR 2023
<
1
…
40
41
42
…
146
>