conftrace_

← Optimization & Theory

Machine Learning › Optimization & Theory ›

Neural Network Optimization

3,648 papers

Papers per year

Papers

Mixture of Attention Heads: Selecting Attention Heads Per Token EMNLP 2022

Improving Stability of Fine-Tuning Pretrained Language Models via Component-Wise Gradient Norm Clipping EMNLP 2022

LittleBird: Efficient Faster & Longer Transformer for Question Answering EMNLP 2022

R-TeaFor: Regularized Teacher-Forcing for Abstractive Summarization EMNLP 2022

The Devil in Linear Transformer EMNLP 2022

Efficient Pre-training of Masked Language Model via Concept-based Curriculum Masking EMNLP 2022

STGN: an Implicit Regularization Method for Learning with Noisy Labels in Natural Language Processing EMNLP 2022

Efficient Adversarial Training with Robust Early-Bird Tickets EMNLP 2022

Evaluating the Impact of Model Scale for Compositional Generalization in Semantic Parsing EMNLP 2022

Meta-Learning Fast Weight Language Models EMNLP 2022

Fine-Tuning Pre-trained Transformers into Decaying Fast Weights EMNLP 2022

Sampling-Based Approximations to Minimum Bayes Risk Decoding for Neural Machine Translation EMNLP 2022

XPrompt: Exploring the Extreme of Prompt Tuning EMNLP 2022

Knowledge Distillation based Contextual Relevance Matching for E-commerce Product Search EMNLP 2022

Late Prompt Tuning: A Late Prompt Could Be Better Than Many Prompts EMNLP 2022

Improving Sharpness-Aware Minimization with Fisher Mask for Better Generalization on Language Models EMNLP 2022

Improving Generalization of Pre-trained Language Models via Stochastic Weight Averaging EMNLP 2022

Sharpness-Aware Minimization with Dynamic Reweighting EMNLP 2022

Search to Pass Messages for Temporal Knowledge Graph Completion EMNLP 2022

FPT: Improving Prompt Tuning Efficiency via Progressive Training EMNLP 2022

Scaling Laws Under the Microscope: Predicting Transformer Performance from Small Scale Experiments EMNLP 2022

A Minimal Model for Compositional Generalization on gSCAN EMNLP 2022

Using Deep Mixture-of-Experts to Detect Word Meaning Shift for TempoWiC EMNLP 2022

Exploring Robustness of Prefix Tuning in Noisy Data: A Case Study in Financial Sentiment Analysis EMNLP 2022

ANVITA-African: A Multilingual Neural Machine Translation System for African Languages EMNLP 2022