conftrace_

← Optimization & Theory

Machine Learning › Optimization & Theory ›

Neural Network Optimization

3,648 papers

Papers per year

Papers

HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation ACL 2023

CAME: Confidence-guided Adaptive Memory Efficient Optimization ACL 2023

Measuring the Instability of Fine-Tuning ACL 2023

Learning Better Masking for Better Language Model Pre-training ACL 2023

Laziness Is a Virtue When It Comes to Compositionality in Neural Semantic Parsing ACL 2023

AD-KD: Attribution-Driven Knowledge Distillation for Language Model Compression ACL 2023

HiFi: High-Information Attention Heads Hold for Parameter-Efficient Model Adaptation ACL 2023

MixCE: Training Autoregressive Language Models by Mixing Forward and Reverse Cross-Entropies ACL 2023

Interpreting Positional Information in Perspective of Word Order ACL 2023

Revisiting Token Dropping Strategy in Efficient BERT Pretraining ACL 2023

Rethinking the Role of Scale for In-Context Learning: An Interpretability-based Case Study at 66 Billion Scale ACL 2023

Hard Sample Aware Prompt-Tuning ACL 2023

Finding the Pillars of Strength for Multi-Head Attention ACL 2023

Decoder Tuning: Efficient Language Understanding as Decoding ACL 2023

Two-Stage Fine-Tuning for Improved Bias and Variance for Large Pretrained Language Models ACL 2023

A Natural Bias for Language Generation Models ACL 2023

Focused Prefix Tuning for Controllable Text Generation ACL 2023

Prefix Propagation: Parameter-Efficient Tuning for Long Sequences ACL 2023

When to Use Efficient Self Attention? Profiling Text, Speech and Image Transformer Variants ACL 2023

Japanese-to-English Simultaneous Dubbing Prototype ACL 2023

LECO: Improving Early Exiting via Learned Exits and Comparison-based Exiting Mechanism ACL 2023

Building Accurate Low Latency ASR for Streaming Voice Search in E-commerce ACL 2023

BADGE: Speeding Up BERT Inference after Deployment via Block-wise Bypasses and Divergence-based Early Exiting ACL 2023

Search Query Spell Correction with Weak Supervision in E-commerce ACL 2023

Overcoming Catastrophic Forgetting in Massively Multilingual Continual Learning ACL 2023