conftrace_

← Optimization & Theory

Machine Learning › Optimization & Theory ›

Neural Network Optimization

3,648 papers

Papers per year

Papers

A Stochastic Momentum Accelerated Quasi-Newton Method for Neural Networks (Student Abstract) AAAI 2022

NeuralArTS: Structuring Neural Architecture Search with Type Theory (Student Abstract) AAAI 2022

Understanding Stochastic Optimization Behavior at the Layer Update Level (Student Abstract) AAAI 2022

The Importance of Hyperparameter Optimisation for Facial Recognition Applications AAAI 2022

Characterizing and addressing the issue of oversmoothing in neural autoregressive sequence modeling AACL 2022

Rare Tokens Degenerate All Tokens: Improving Neural Text Generation via Adaptive Gradient Gating for Rare Token Embeddings ACL 2022

Long-range Sequence Modeling with Predictable Sparse Attention ACL 2022

Composable Sparse Fine-Tuning for Cross-Lingual Transfer ACL 2022

Probing Structured Pruning on Multilingual Pre-trained Models: Settings, Algorithms, and Efficiency ACL 2022

Robust Lottery Tickets for Pre-trained Language Models ACL 2022

Confidence Based Bidirectional Global Context Aware Training Framework for Neural Machine Translation ACL 2022

Boundary Smoothing for Named Entity Recognition ACL 2022

Overcoming a Theoretical Limitation of Self-Attention ACL 2022

PPT: Pre-trained Prompt Tuning for Few-shot Learning ACL 2022

BitFit: Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models ACL 2022

A Gentle Introduction to Deep Nets and Opportunities for the Future ACL 2022

Finding the Dominant Winning Ticket in Pre-Trained Language Models ACL 2022

Modality-specific Learning Rates for Effective Multimodal Additive Late-fusion ACL 2022

A Simple Hash-Based Early Exiting Approach For Language Understanding and Generation ACL 2022

Predicting Attention Sparsity in Transformers ACL 2022

On the Convergence of Decentralized Adaptive Gradient Methods ACML 2022

Dynamic Forward and Backward Sparse Training (DFBST): Accelerated Deep Learning through Completely Sparse Training Schedule ACML 2022

AFRNN: Stable RNN with Top Down Feedback and Antisymmetry ACML 2022

Neural Contextual Bandits without Regret AISTATS 2022

Finding Dynamics Preserving Adversarial Winning Tickets AISTATS 2022