model compression

3283 papers

Explore in graph

Also known as

MC

Co-occurring keywords

knowledge distillation (3680) large language model (12755) neural network (6616) efficient computing (779) neural network optimization (1293) transfer learning (5442) convolutional neural network (4216) neural network pruning (265) language model (4573) parameter efficiency (415)

Papers

Activation Map Compression through Tensor Decomposition for Deep Learning NIPS 2024

DDK: Distilling Domain Knowledge for Efficient Large Language Models NIPS 2024

SlimGPT: Layer-wise Structured Pruning for Large Language Models NIPS 2024

Adaptive Layer Sparsity for Large Language Models via Activation Correlation Assessment NIPS 2024

Q-VLM: Post-training Quantization for Large Vision-Language Models NIPS 2024

$\textit{Read-ME}$: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-Design NIPS 2024

PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models NIPS 2024

Mixture of Scales: Memory-Efficient Token-Adaptive Binarization for Large Language Models NIPS 2024

Search for Efficient Large Language Models NIPS 2024

LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS NIPS 2024

Uncovering the Redundancy in Graph Self-supervised Learning Models NIPS 2024

The Iterative Optimal Brain Surgeon: Faster Sparse Recovery by Leveraging Second-Order Information NIPS 2024

Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs NIPS 2024

MiniCache: KV Cache Compression in Depth Dimension for Large Language Models NIPS 2024

Teaching Tiny Minds: Exploring Methods to Enhance Knowledge Distillation for Small Language Models CONLL 2024

Bit-Shrinking: Limiting Instantaneous Sharpness for Improving Post-Training Quantization CVPR 2023

Adaptive Data-Free Quantization CVPR 2023

Solving Oscillation Problem in Post-Training Quantization Through a Theoretical Perspective CVPR 2023

Learning To Retain While Acquiring: Combating Distribution-Shift in Adversarial Data-Free Knowledge Distillation CVPR 2023

Discriminator-Cooperated Feature Map Distillation for GAN Compression CVPR 2023

Efficient Transformer Knowledge Distillation: A Performance Review EMNLP 2023

IterDE: An Iterative Knowledge Distillation Framework for Knowledge Graph Embeddings AAAI 2023

Training Debiased Subnetworks With Contrastive Weight Pruning CVPR 2023

Compacting Binary Neural Networks by Sparse Kernel Selection CVPR 2023

Diffusion Probabilistic Model Made Slim CVPR 2023