← Optimization & Theory

Deep Learning › Optimization & Theory ›

Model Compression

1674 directly classified papers

Papers per year

Papers

Pyramid-BERT: Reducing Complexity via Successive Core-set based Token Selection ACL 2022

DQ-BART: Efficient Sequence-to-Sequence Model via Joint Distillation and Quantization ACL 2022

MoEfication: Transformer Feed-forward Layers are Mixtures of Experts ACL 2022

TextFusion: Privacy-Preserving Pre-trained Model Inference via Token Fusion EMNLP 2022

Intriguing Properties of Compression on Multilingual Models EMNLP 2022

Evaluating the Impact of Model Scale for Compositional Generalization in Semantic Parsing EMNLP 2022

EdgeFormer: A Parameter-Efficient Transformer for On-Device Seq2seq Generation EMNLP 2022

XPrompt: Exploring the Extreme of Prompt Tuning EMNLP 2022

Modular and Parameter-Efficient Fine-Tuning for NLP Models EMNLP 2022

BMCook: A Task-agnostic Compression Toolkit for Big Models EMNLP 2022

Developing Prefix-Tuning Models for Hierarchical Text Classification EMNLP 2022

Fast Vocabulary Transfer for Language Model Compression EMNLP 2022

XDoc: Unified Pre-training for Cross-Format Document Understanding EMNLP 2022

Using Deep Mixture-of-Experts to Detect Word Meaning Shift for TempoWiC EMNLP 2022

Control Prefixes for Parameter-Efficient Text Generation EMNLP 2022

Legal-Tech Open Diaries: Lesson learned on how to develop and deploy light-weight models in the era of humongous Language Models EMNLP 2022

Parameter-Efficient Legal Domain Adaptation EMNLP 2022

Efficient Two-Stage Progressive Quantization of BERT EMNLP 2022

Who Says Elephants Can’t Run: Bringing Large Scale MoE Models into Cloud Scale Production EMNLP 2022

Edinburgh’s Submission to the WMT 2022 Efficiency Task EMNLP 2022

Too Brittle to Touch: Comparing the Stability of Quantization and Distillation towards Developing Low-Resource MT Models EMNLP 2022

PrivateSNN: Privacy-Preserving Spiking Neural Networks AAAI 2022

Elastic-Link for Binarized Neural Networks AAAI 2022

COST-EFF: Collaborative Optimization of Spatial and Temporal Efficiency with Slenderized Multi-exit Language Models EMNLP 2022

Tutoring Helps Students Learn Better: Improving Knowledge Distillation for BERT with Tutor Network EMNLP 2022