conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Optimization & Theory
Machine Learning
›
Optimization & Theory
›
Neural Network Optimization
3,648 papers
Papers per year
2001: 1
2003: 1
2005: 2
2006: 3
2007: 6
2008: 1
2009: 7
2010: 5
2011: 7
2012: 9
2013: 17
2014: 18
2015: 40
2016: 76
2017: 113
2018: 214
2019: 324
2020: 414
2021: 489
2022: 445
2023: 524
2024: 469
2025: 386
2026: 77
Papers
FastMem: Fast Memorization of Prompt Improves Context Awareness of Large Language Models
EMNLP 2024
HiFT: A Hierarchical Full Parameter Fine-Tuning Strategy
EMNLP 2024
Recurrent neural networks: vanishing and exploding gradients are not the end of the story
NIPS 2024
Deep Learning for Computing Convergence Rates of Markov Chains
NIPS 2024
The Feature Speed Formula: a flexible approach to scale hyper-parameters of deep neural networks
NIPS 2024
Parameter-Agnostic Optimization under Relaxed Smoothness
AISTATS 2024
Large Stepsize Gradient Descent for Non-Homogeneous Two-Layer Networks: Margin Improvement and Fast Optimization
NIPS 2024
Improving Neural Network Generalization on Data-Limited Regression with Doubly-Robust Boosting
AAAI 2024
Chain-of-Thought Reasoning Without Prompting
NIPS 2024
Investigating Acceleration of LLaMA Inference by Enabling Intermediate Layer Decoding via Instruction Tuning with ‘LITE’
NAACL 2024
Taming Nonconvex Stochastic Mirror Descent with General Bregman Divergence
AISTATS 2024
Stochastic Modified Flows, Mean-Field Limits and Dynamics of Stochastic Gradient Descent
JMLR 2024
Tending Towards Stability: Convergence Challenges in Small Language Models
EMNLP 2024
Adaptive Sharpness-Aware Pruning for Robust Sparse Networks
ICLR 2024
Reasoning Paths Optimization: Learning to Reason and Explore From Diverse Paths
EMNLP 2024
Ordered Momentum for Asynchronous SGD
NIPS 2024
A Structure-Aware Framework for Learning Device Placements on Computation Graphs
NIPS 2024
Separation and Bias of Deep Equilibrium Models on Expressivity and Learning Dynamics
NIPS 2024
Vision Mamba Mender
NIPS 2024
Linguistic Fingerprint in Transformer Models: How Language Variation Influences Parameter Selection in Irony Detection
COLING 2024
Beyond Slow Signs in High-fidelity Model Extraction
NIPS 2024
Batch Normalization Is Blind to the First and Second Derivatives of the Loss
AAAI 2024
Understanding Linear Probing then Fine-tuning Language Models from NTK Perspective
NIPS 2024
Beyond Fine-tuning: Unleashing the Potential of Continuous Pretraining for Clinical LLMs.
EMNLP 2024
Preparing Lessons for Progressive Training on Language Models
AAAI 2024
<
1
…
30
31
32
…
146
>