Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Optimization & Theory
Machine Learning
›
Optimization & Theory
›
Neural Network Optimization
3648 directly classified papers
Papers per year
2001: 1
2003: 1
2005: 2
2006: 3
2007: 6
2008: 1
2009: 7
2010: 5
2011: 7
2012: 9
2013: 17
2014: 18
2015: 40
2016: 76
2017: 113
2018: 214
2019: 324
2020: 414
2021: 489
2022: 445
2023: 524
2024: 469
2025: 386
2026: 77
Papers
A Modular Analysis of Adaptive (Non-)Convex Optimization: Optimism, Composite Objectives, and Variational Bounds
ALT 2017
Gradient Descent Can Take Exponential Time to Escape Saddle Points
NIPS 2017
Nonlinear Acceleration of Stochastic Algorithms
NIPS 2017
Fast Black-box Variational Inference through Stochastic Trust-Region Optimization
NIPS 2017
Deep Neural Machine Translation with Linear Associative Unit
ACL 2017
Annealed f-Smoothing as a Mechanism to Speed up Neural Network Training
INTERSPEECH 2017
Training Deep Networks without Learning Rates Through Coin Betting
NIPS 2017
The Marginal Value of Adaptive Gradient Methods in Machine Learning
NIPS 2017
Train longer, generalize better: closing the generalization gap in large batch training of neural networks
NIPS 2017
Convergence Analysis of Two-layer Neural Networks with ReLU Activation
NIPS 2017
Joint Training of Expanded End-to-End DNN for Text-Dependent Speaker Verification
INTERSPEECH 2017
A Nested Attention Neural Hybrid Model for Grammatical Error Correction
ACL 2017
Towards Generalization and Simplicity in Continuous Control
NIPS 2017
Runtime Neural Pruning
NIPS 2017
Plan, Attend, Generate: Planning for Sequence-to-Sequence Models
NIPS 2017
All You Need Is Beyond a Good Init: Exploring Better Solution for Training Extremely Deep Convolutional Neural Networks With Orthonormality and Modulation
CVPR 2017
Global SNR Estimation of Speech Signals for Unknown Noise Conditions Using Noise Adapted Non-Linear Regression
INTERSPEECH 2017
Unit Selection with Hierarchical Cascaded Long Short Term Memory Bidirectional Recurrent Neural Nets
INTERSPEECH 2017
On orthogonality and learning recurrent networks with long term dependencies
ICML 2017
More Is Less: A More Complicated Network With Less Inference Complexity
CVPR 2017
Discrete Duration Model for Speech Synthesis
INTERSPEECH 2017
Robustness Over Time-Varying Channels in DNN-HMM ASR Based Human-Robot Interaction
INTERSPEECH 2017
Recurrent Neural Aligner: An Encoder-Decoder Neural Network Model for Sequence to Sequence Mapping
INTERSPEECH 2017
Preventing Gradient Explosions in Gated Recurrent Units
NIPS 2017
Noisy Softmax: Improving the Generalization Ability of DCNN via Postponing the Early Softmax Saturation
CVPR 2017
<
1
…
134
135
136
…
146
>