model compression

3283 papers

Explore in graph

Also known as

MC

Co-occurring keywords

knowledge distillation (3680) large language model (12755) neural network (6616) efficient computing (779) neural network optimization (1293) transfer learning (5442) convolutional neural network (4216) neural network pruning (265) language model (4573) parameter efficiency (415)

Papers

A Novel Differentiable Mixed-Precision Quantization Search Framework for Alleviating the Matthew Effect and Improving Robustness ACML 2022

Information-Theoretic GAN Compression with Variational Energy-based Model NIPS 2022

Leveraging Inter-Layer Dependency for Post -Training Quantization NIPS 2022

BiMLP: Compact Binary Architectures for Vision Multi-Layer Perceptrons NIPS 2022

PokeBNN: A Binary Pursuit of Lightweight Accuracy CVPR 2022

Masking Adversarial Damage: Finding Adversarial Saliency for Robust and Sparse Network CVPR 2022

It's All in the Teacher: Zero-Shot Quantization Brought Closer to the Teacher CVPR 2022

AutoMTL: A Programming Framework for Automating Efficient Multi-Task Learning NIPS 2022

Partially-Random Initialization: A Smoking Gun for Binarization Hypothesis of BERT EMNLP 2022

Continuation KD: Improved Knowledge Distillation through the Lens of Continuation Optimization EMNLP 2022

Spartan: Differentiable Sparsity via Regularized Transportation NIPS 2022

Pruning’s Effect on Generalization Through the Lens of Training and Regularization NIPS 2022

JANUS: Joint Autoregressive and Non-autoregressive Training with Auxiliary Loss for Sequence Generation EMNLP 2022

Pseudo-Relevance for Enhancing Document Representation EMNLP 2022

Knowledge Distillation Transfer Sets and their Impact on Downstream NLU Tasks EMNLP 2022

Zero-Shot Dynamic Quantization for Transformer Inference EMNLP 2022

Efficient Knowledge Distillation from Model Checkpoints NIPS 2022

GPT3.int8(): 8-bit Matrix Multiplication for Transformers at Scale NIPS 2022

BATUDE: Budget-Aware Neural Network Compression Based on Tucker Decomposition AAAI 2022

Recall Distortion in Neural Network Pruning and the Undecayed Pruning Algorithm NIPS 2022

Knowledge Distillation from A Stronger Teacher NIPS 2022

Rethinking Resolution in the Context of Efficient Video Recognition NIPS 2022

FedSR: A Simple and Effective Domain Generalization Method for Federated Learning NIPS 2022

Localization Distillation for Dense Object Detection CVPR 2022

Attentive Fine-Grained Structured Sparsity for Image Restoration CVPR 2022