Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Optimization & Theory
Deep Learning
›
Optimization & Theory
›
Model Compression
1674 directly classified papers
Papers per year
2012: 1
2013: 2
2014: 2
2015: 7
2016: 9
2017: 27
2018: 51
2019: 79
2020: 189
2021: 165
2022: 206
2023: 207
2024: 325
2025: 399
2026: 5
Papers
Removing Batch Normalization Boosts Adversarial Training
ICML 2022
Optimal Clipping and Magnitude-aware Differentiation for Improved Quantization-aware Training
ICML 2022
AdapLeR: Speeding up Inference by Adaptive Length Reduction
ACL 2022
Attention Temperature Matters in Abstractive Summarization Distillation
ACL 2022
Sparse Progressive Distillation: Resolving Overfitting under Pretrain-and-Finetune Paradigm
ACL 2022
Multi-Granularity Structural Knowledge Distillation for Language Model Compression
ACL 2022
FrugalScore: Learning Cheaper, Lighter and Faster Evaluation Metrics for Automatic Text Generation
ACL 2022
Composable Sparse Fine-Tuning for Cross-Lingual Transfer
ACL 2022
Probing Structured Pruning on Multilingual Pre-trained Models: Settings, Algorithms, and Efficiency
ACL 2022
bert2BERT: Towards Reusable Pretrained Language Models
ACL 2022
Robust Lottery Tickets for Pre-trained Language Models
ACL 2022
Token Dropping for Efficient BERT Pretraining
ACL 2022
Compression of Generative Pre-trained Language Models via Quantization
ACL 2022
E-LANG: Energy-Based Joint Inferencing of Super and Swift Language Models
ACL 2022
SDR: Efficient Neural Re-ranking using Succinct Document Representation
ACL 2022
BitFit: Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models
ACL 2022
TextPruner: A Model Pruning Toolkit for Pre-Trained Language Models
ACL 2022
BMInf: An Efficient Toolkit for Big Model Inference and Tuning
ACL 2022
Finding the Dominant Winning Ticket in Pre-Trained Language Models
ACL 2022
Aligned Weight Regularizers for Pruning Pretrained Neural Networks
ACL 2022
Diverse Lottery Tickets Boost Ensemble from a Single Pretrained Model
ACL 2022
Parameter-Efficient Abstractive Question Answering over Tables or Text
ACL 2022
Knowledge Base Index Compression via Dimensionality and Precision Reduction
ACL 2022
Universality of Winning Tickets: A Renormalization Group Perspective
ICML 2022
DeepSpeed-MoE: Advancing Mixture-of-Experts Inference and Training to Power Next-Generation AI Scale
ICML 2022
<
1
…
41
42
43
…
67
>