Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Optimization & Theory
Deep Learning
›
Optimization & Theory
›
Model Compression
1674 directly classified papers
Papers per year
2012: 1
2013: 2
2014: 2
2015: 7
2016: 9
2017: 27
2018: 51
2019: 79
2020: 189
2021: 165
2022: 206
2023: 207
2024: 325
2025: 399
2026: 5
Papers
Pyramid-BERT: Reducing Complexity via Successive Core-set based Token Selection
ACL 2022
DQ-BART: Efficient Sequence-to-Sequence Model via Joint Distillation and Quantization
ACL 2022
MoEfication: Transformer Feed-forward Layers are Mixtures of Experts
ACL 2022
TextFusion: Privacy-Preserving Pre-trained Model Inference via Token Fusion
EMNLP 2022
Intriguing Properties of Compression on Multilingual Models
EMNLP 2022
Evaluating the Impact of Model Scale for Compositional Generalization in Semantic Parsing
EMNLP 2022
EdgeFormer: A Parameter-Efficient Transformer for On-Device Seq2seq Generation
EMNLP 2022
XPrompt: Exploring the Extreme of Prompt Tuning
EMNLP 2022
Modular and Parameter-Efficient Fine-Tuning for NLP Models
EMNLP 2022
BMCook: A Task-agnostic Compression Toolkit for Big Models
EMNLP 2022
Developing Prefix-Tuning Models for Hierarchical Text Classification
EMNLP 2022
Fast Vocabulary Transfer for Language Model Compression
EMNLP 2022
XDoc: Unified Pre-training for Cross-Format Document Understanding
EMNLP 2022
Using Deep Mixture-of-Experts to Detect Word Meaning Shift for TempoWiC
EMNLP 2022
Control Prefixes for Parameter-Efficient Text Generation
EMNLP 2022
Legal-Tech Open Diaries: Lesson learned on how to develop and deploy light-weight models in the era of humongous Language Models
EMNLP 2022
Parameter-Efficient Legal Domain Adaptation
EMNLP 2022
Efficient Two-Stage Progressive Quantization of BERT
EMNLP 2022
Who Says Elephants Can’t Run: Bringing Large Scale MoE Models into Cloud Scale Production
EMNLP 2022
Edinburgh’s Submission to the WMT 2022 Efficiency Task
EMNLP 2022
Too Brittle to Touch: Comparing the Stability of Quantization and Distillation towards Developing Low-Resource MT Models
EMNLP 2022
PrivateSNN: Privacy-Preserving Spiking Neural Networks
AAAI 2022
Elastic-Link for Binarized Neural Networks
AAAI 2022
COST-EFF: Collaborative Optimization of Spatial and Temporal Efficiency with Slenderized Multi-exit Language Models
EMNLP 2022
Tutoring Helps Students Learn Better: Improving Knowledge Distillation for BERT with Tutor Network
EMNLP 2022
<
1
…
44
45
46
…
67
>