Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Keywords
model compression
3283 papers
Explore in graph
Also known as
MC
Co-occurring keywords
knowledge distillation
(3680)
large language model
(12755)
neural network
(6616)
efficient computing
(779)
neural network optimization
(1293)
transfer learning
(5442)
convolutional neural network
(4216)
neural network pruning
(265)
language model
(4573)
parameter efficiency
(415)
Papers
A Novel Differentiable Mixed-Precision Quantization Search Framework for Alleviating the Matthew Effect and Improving Robustness
ACML 2022
Information-Theoretic GAN Compression with Variational Energy-based Model
NIPS 2022
Leveraging Inter-Layer Dependency for Post -Training Quantization
NIPS 2022
BiMLP: Compact Binary Architectures for Vision Multi-Layer Perceptrons
NIPS 2022
PokeBNN: A Binary Pursuit of Lightweight Accuracy
CVPR 2022
Masking Adversarial Damage: Finding Adversarial Saliency for Robust and Sparse Network
CVPR 2022
It's All in the Teacher: Zero-Shot Quantization Brought Closer to the Teacher
CVPR 2022
AutoMTL: A Programming Framework for Automating Efficient Multi-Task Learning
NIPS 2022
Partially-Random Initialization: A Smoking Gun for Binarization Hypothesis of BERT
EMNLP 2022
Continuation KD: Improved Knowledge Distillation through the Lens of Continuation Optimization
EMNLP 2022
Spartan: Differentiable Sparsity via Regularized Transportation
NIPS 2022
Pruning’s Effect on Generalization Through the Lens of Training and Regularization
NIPS 2022
JANUS: Joint Autoregressive and Non-autoregressive Training with Auxiliary Loss for Sequence Generation
EMNLP 2022
Pseudo-Relevance for Enhancing Document Representation
EMNLP 2022
Knowledge Distillation Transfer Sets and their Impact on Downstream NLU Tasks
EMNLP 2022
Zero-Shot Dynamic Quantization for Transformer Inference
EMNLP 2022
Efficient Knowledge Distillation from Model Checkpoints
NIPS 2022
GPT3.int8(): 8-bit Matrix Multiplication for Transformers at Scale
NIPS 2022
BATUDE: Budget-Aware Neural Network Compression Based on Tucker Decomposition
AAAI 2022
Recall Distortion in Neural Network Pruning and the Undecayed Pruning Algorithm
NIPS 2022
Knowledge Distillation from A Stronger Teacher
NIPS 2022
Rethinking Resolution in the Context of Efficient Video Recognition
NIPS 2022
FedSR: A Simple and Effective Domain Generalization Method for Federated Learning
NIPS 2022
Localization Distillation for Dense Object Detection
CVPR 2022
Attentive Fine-Grained Structured Sparsity for Image Restoration
CVPR 2022
<
1
…
86
87
88
…
132
>