← Optimization & Theory

Machine Learning › Optimization & Theory ›

Neural Network Optimization

3648 directly classified papers

Papers per year

Papers

A Modular Analysis of Adaptive (Non-)Convex Optimization: Optimism, Composite Objectives, and Variational Bounds ALT 2017

Gradient Descent Can Take Exponential Time to Escape Saddle Points NIPS 2017

Nonlinear Acceleration of Stochastic Algorithms NIPS 2017

Fast Black-box Variational Inference through Stochastic Trust-Region Optimization NIPS 2017

Deep Neural Machine Translation with Linear Associative Unit ACL 2017

Annealed f-Smoothing as a Mechanism to Speed up Neural Network Training INTERSPEECH 2017

Training Deep Networks without Learning Rates Through Coin Betting NIPS 2017

The Marginal Value of Adaptive Gradient Methods in Machine Learning NIPS 2017

Train longer, generalize better: closing the generalization gap in large batch training of neural networks NIPS 2017

Convergence Analysis of Two-layer Neural Networks with ReLU Activation NIPS 2017

Joint Training of Expanded End-to-End DNN for Text-Dependent Speaker Verification INTERSPEECH 2017

A Nested Attention Neural Hybrid Model for Grammatical Error Correction ACL 2017

Towards Generalization and Simplicity in Continuous Control NIPS 2017

Runtime Neural Pruning NIPS 2017

Plan, Attend, Generate: Planning for Sequence-to-Sequence Models NIPS 2017

All You Need Is Beyond a Good Init: Exploring Better Solution for Training Extremely Deep Convolutional Neural Networks With Orthonormality and Modulation CVPR 2017

Global SNR Estimation of Speech Signals for Unknown Noise Conditions Using Noise Adapted Non-Linear Regression INTERSPEECH 2017

Unit Selection with Hierarchical Cascaded Long Short Term Memory Bidirectional Recurrent Neural Nets INTERSPEECH 2017

On orthogonality and learning recurrent networks with long term dependencies ICML 2017

More Is Less: A More Complicated Network With Less Inference Complexity CVPR 2017

Discrete Duration Model for Speech Synthesis INTERSPEECH 2017

Robustness Over Time-Varying Channels in DNN-HMM ASR Based Human-Robot Interaction INTERSPEECH 2017

Recurrent Neural Aligner: An Encoder-Decoder Neural Network Model for Sequence to Sequence Mapping INTERSPEECH 2017

Preventing Gradient Explosions in Gated Recurrent Units NIPS 2017

Noisy Softmax: Improving the Generalization Ability of DCNN via Postponing the Early Softmax Saturation CVPR 2017