conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Optimization & Theory
Machine Learning
›
Optimization & Theory
›
Neural Network Optimization
3,648 papers
Papers per year
2001: 1
2003: 1
2005: 2
2006: 3
2007: 6
2008: 1
2009: 7
2010: 5
2011: 7
2012: 9
2013: 17
2014: 18
2015: 40
2016: 76
2017: 113
2018: 214
2019: 324
2020: 414
2021: 489
2022: 445
2023: 524
2024: 469
2025: 386
2026: 77
Papers
HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation
ACL 2023
CAME: Confidence-guided Adaptive Memory Efficient Optimization
ACL 2023
Measuring the Instability of Fine-Tuning
ACL 2023
Learning Better Masking for Better Language Model Pre-training
ACL 2023
Laziness Is a Virtue When It Comes to Compositionality in Neural Semantic Parsing
ACL 2023
AD-KD: Attribution-Driven Knowledge Distillation for Language Model Compression
ACL 2023
HiFi: High-Information Attention Heads Hold for Parameter-Efficient Model Adaptation
ACL 2023
MixCE: Training Autoregressive Language Models by Mixing Forward and Reverse Cross-Entropies
ACL 2023
Interpreting Positional Information in Perspective of Word Order
ACL 2023
Revisiting Token Dropping Strategy in Efficient BERT Pretraining
ACL 2023
Rethinking the Role of Scale for In-Context Learning: An Interpretability-based Case Study at 66 Billion Scale
ACL 2023
Hard Sample Aware Prompt-Tuning
ACL 2023
Finding the Pillars of Strength for Multi-Head Attention
ACL 2023
Decoder Tuning: Efficient Language Understanding as Decoding
ACL 2023
Two-Stage Fine-Tuning for Improved Bias and Variance for Large Pretrained Language Models
ACL 2023
A Natural Bias for Language Generation Models
ACL 2023
Focused Prefix Tuning for Controllable Text Generation
ACL 2023
Prefix Propagation: Parameter-Efficient Tuning for Long Sequences
ACL 2023
When to Use Efficient Self Attention? Profiling Text, Speech and Image Transformer Variants
ACL 2023
Japanese-to-English Simultaneous Dubbing Prototype
ACL 2023
LECO: Improving Early Exiting via Learned Exits and Comparison-based Exiting Mechanism
ACL 2023
Building Accurate Low Latency ASR for Streaming Voice Search in E-commerce
ACL 2023
BADGE: Speeding Up BERT Inference after Deployment via Block-wise Bypasses and Divergence-based Early Exiting
ACL 2023
Search Query Spell Correction with Weak Supervision in E-commerce
ACL 2023
Overcoming Catastrophic Forgetting in Massively Multilingual Continual Learning
ACL 2023
<
1
…
45
46
47
…
146
>