conftrace_

← Optimization & Theory

Machine Learning › Optimization & Theory ›

Neural Network Optimization

3,648 papers

Papers per year

Papers

FlatMatch: Bridging Labeled Data and Unlabeled Data with Cross-Sharpness for Semi-Supervised Learning NIPS 2023

Self-Attentive Pooling for Efficient Deep Learning WACV 2023

Local Temperature Beam Search: Avoid Neural Text DeGeneration via Enhanced Calibration ACL 2023

Rethinking the Role of Scale for In-Context Learning: An Interpretability-based Case Study at 66 Billion Scale ACL 2023

ATHENA: Mathematical Reasoning with Thought Expansion EMNLP 2023

RecursiveDet: End-to-End Region-Based Recursive Object Detection ICCV 2023

The Double-Edged Sword of Implicit Bias: Generalization vs. Robustness in ReLU Networks NIPS 2023

Beyond Lipschitz Smoothness: A Tighter Analysis for Nonconvex Optimization ICML 2023

Learning-Rate-Free Learning by D-Adaptation ICML 2023

On the Convergence of Continual Learning with Adaptive Methods UAI 2023

Learning an Explicit Hyper-parameter Prediction Function Conditioned on Tasks JMLR 2023

MixPath: A Unified Approach for One-shot Neural Architecture Search ICCV 2023

Stability-Based Generalization Analysis of the Asynchronous Decentralized SGD AAAI 2023

Fundamental Limits of Two-layer Autoencoders, and Achieving Them with Gradient Methods ICML 2023

Accelerated Training via Incrementally Growing Neural Networks using Variance Transfer and Learning Rate Adaptation NIPS 2023

Fixing the NTK: From Neural Network Linearizations to Exact Convex Programs NIPS 2023

Global Convergence Analysis of Local SGD for Two-layer Neural Network without Overparameterization NIPS 2023

Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding EMNLP 2023

Training biologically plausible recurrent neural networks on cognitive tasks with long-term dependencies NIPS 2023

GeNAS: Neural Architecture Search with Better Generalization IJCAI 2023

SERF: Towards Better Training of Deep Neural Networks Using Log-Softplus ERror Activation Function WACV 2023

On the Dynamics Under the Unhinged Loss and Beyond JMLR 2023

ROME: Robustifying Memory-Efficient NAS via Topology Disentanglement and Gradient Accumulation ICCV 2023

Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training NIPS 2023

Continuous Spatiotemporal Transformer ICML 2023