conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Optimization & Theory
Machine Learning
›
Optimization & Theory
›
Neural Network Optimization
3,648 papers
Papers per year
2001: 1
2003: 1
2005: 2
2006: 3
2007: 6
2008: 1
2009: 7
2010: 5
2011: 7
2012: 9
2013: 17
2014: 18
2015: 40
2016: 76
2017: 113
2018: 214
2019: 324
2020: 414
2021: 489
2022: 445
2023: 524
2024: 469
2025: 386
2026: 77
Papers
Mixture of Attention Heads: Selecting Attention Heads Per Token
EMNLP 2022
Improving Stability of Fine-Tuning Pretrained Language Models via Component-Wise Gradient Norm Clipping
EMNLP 2022
LittleBird: Efficient Faster & Longer Transformer for Question Answering
EMNLP 2022
R-TeaFor: Regularized Teacher-Forcing for Abstractive Summarization
EMNLP 2022
The Devil in Linear Transformer
EMNLP 2022
Efficient Pre-training of Masked Language Model via Concept-based Curriculum Masking
EMNLP 2022
STGN: an Implicit Regularization Method for Learning with Noisy Labels in Natural Language Processing
EMNLP 2022
Efficient Adversarial Training with Robust Early-Bird Tickets
EMNLP 2022
Evaluating the Impact of Model Scale for Compositional Generalization in Semantic Parsing
EMNLP 2022
Meta-Learning Fast Weight Language Models
EMNLP 2022
Fine-Tuning Pre-trained Transformers into Decaying Fast Weights
EMNLP 2022
Sampling-Based Approximations to Minimum Bayes Risk Decoding for Neural Machine Translation
EMNLP 2022
XPrompt: Exploring the Extreme of Prompt Tuning
EMNLP 2022
Knowledge Distillation based Contextual Relevance Matching for E-commerce Product Search
EMNLP 2022
Late Prompt Tuning: A Late Prompt Could Be Better Than Many Prompts
EMNLP 2022
Improving Sharpness-Aware Minimization with Fisher Mask for Better Generalization on Language Models
EMNLP 2022
Improving Generalization of Pre-trained Language Models via Stochastic Weight Averaging
EMNLP 2022
Sharpness-Aware Minimization with Dynamic Reweighting
EMNLP 2022
Search to Pass Messages for Temporal Knowledge Graph Completion
EMNLP 2022
FPT: Improving Prompt Tuning Efficiency via Progressive Training
EMNLP 2022
Scaling Laws Under the Microscope: Predicting Transformer Performance from Small Scale Experiments
EMNLP 2022
A Minimal Model for Compositional Generalization on gSCAN
EMNLP 2022
Using Deep Mixture-of-Experts to Detect Word Meaning Shift for TempoWiC
EMNLP 2022
Exploring Robustness of Prefix Tuning in Noisy Data: A Case Study in Financial Sentiment Analysis
EMNLP 2022
ANVITA-African: A Multilingual Neural Machine Translation System for African Languages
EMNLP 2022
<
1
…
69
70
71
…
146
>