conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Optimization & Theory
Machine Learning
›
Optimization & Theory
›
Neural Network Optimization
3,648 papers
Papers per year
2001: 1
2003: 1
2005: 2
2006: 3
2007: 6
2008: 1
2009: 7
2010: 5
2011: 7
2012: 9
2013: 17
2014: 18
2015: 40
2016: 76
2017: 113
2018: 214
2019: 324
2020: 414
2021: 489
2022: 445
2023: 524
2024: 469
2025: 386
2026: 77
Papers
A Stochastic Momentum Accelerated Quasi-Newton Method for Neural Networks (Student Abstract)
AAAI 2022
NeuralArTS: Structuring Neural Architecture Search with Type Theory (Student Abstract)
AAAI 2022
Understanding Stochastic Optimization Behavior at the Layer Update Level (Student Abstract)
AAAI 2022
The Importance of Hyperparameter Optimisation for Facial Recognition Applications
AAAI 2022
Characterizing and addressing the issue of oversmoothing in neural autoregressive sequence modeling
AACL 2022
Rare Tokens Degenerate All Tokens: Improving Neural Text Generation via Adaptive Gradient Gating for Rare Token Embeddings
ACL 2022
Long-range Sequence Modeling with Predictable Sparse Attention
ACL 2022
Composable Sparse Fine-Tuning for Cross-Lingual Transfer
ACL 2022
Probing Structured Pruning on Multilingual Pre-trained Models: Settings, Algorithms, and Efficiency
ACL 2022
Robust Lottery Tickets for Pre-trained Language Models
ACL 2022
Confidence Based Bidirectional Global Context Aware Training Framework for Neural Machine Translation
ACL 2022
Boundary Smoothing for Named Entity Recognition
ACL 2022
Overcoming a Theoretical Limitation of Self-Attention
ACL 2022
PPT: Pre-trained Prompt Tuning for Few-shot Learning
ACL 2022
BitFit: Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models
ACL 2022
A Gentle Introduction to Deep Nets and Opportunities for the Future
ACL 2022
Finding the Dominant Winning Ticket in Pre-Trained Language Models
ACL 2022
Modality-specific Learning Rates for Effective Multimodal Additive Late-fusion
ACL 2022
A Simple Hash-Based Early Exiting Approach For Language Understanding and Generation
ACL 2022
Predicting Attention Sparsity in Transformers
ACL 2022
On the Convergence of Decentralized Adaptive Gradient Methods
ACML 2022
Dynamic Forward and Backward Sparse Training (DFBST): Accelerated Deep Learning through Completely Sparse Training Schedule
ACML 2022
AFRNN: Stable RNN with Top Down Feedback and Antisymmetry
ACML 2022
Neural Contextual Bandits without Regret
AISTATS 2022
Finding Dynamics Preserving Adversarial Winning Tickets
AISTATS 2022
<
1
…
65
66
67
…
146
>