model compression

3283 papers

Explore in graph

Also known as

MC

Co-occurring keywords

knowledge distillation (3680) large language model (12755) neural network (6616) efficient computing (779) neural network optimization (1293) transfer learning (5442) convolutional neural network (4216) neural network pruning (265) language model (4573) parameter efficiency (415)

Papers

Learning to Augment for Data-scarce Domain BERT Knowledge Distillation AAAI 2021

Faster Depth-Adaptive Transformers AAAI 2021

Continual Learning for Named Entity Recognition AAAI 2021

Accelerating Neural Machine Translation with Partial Word Embedding Compression AAAI 2021

Knowledge Distillation with Noisy Labels for Natural Language Understanding EMNLP 2021

Towards Compact CNNs via Collaborative Compression CVPR 2021

Wasserstein Contrastive Representation Distillation CVPR 2021

ProFormer: Towards On-Device LSH Projection Based Transformers EACL 2021

EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets IJCNLP 2021

Accelerate CNNs from Three Dimensions: A Comprehensive Pruning Framework ICML 2021

Accurate Post Training Quantization With Small Calibration Sets ICML 2021

A Unified Lottery Ticket Hypothesis for Graph Neural Networks ICML 2021

BinaryBERT: Pushing the Limit of BERT Quantization IJCNLP 2021

LeeBERT: Learned Early Exit for BERT with cross-level optimization IJCNLP 2021

Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization IJCNLP 2021

DPOQ: Dynamic Precision Onion Quantization ACML 2021

RGPNet: A Real-Time General Purpose Semantic Segmentation WACV 2021

HAWQ-V3: Dyadic Neural Network Quantization ICML 2021

Enabling Lightweight Fine-tuning for Pre-trained Language Model Compression based on Matrix Product Operators IJCNLP 2021

Multi-stage Pre-training over Simplified Multimodal Pre-training Models IJCNLP 2021

BatchQuant: Quantized-for-all Architecture Search with Robust Quantizer NIPS 2021

Weight Distillation: Transferring the Knowledge in Neural Network Parameters IJCNLP 2021

Marginal Utility Diminishes: Exploring the Minimum Knowledge for BERT Knowledge Distillation IJCNLP 2021

Structural Knowledge Distillation: Tractably Distilling Information for Structured Predictor IJCNLP 2021

Accelerating BERT Inference for Sequence Labeling via Early-Exit ACL 2021