← Optimization & Theory

Deep Learning › Optimization & Theory ›

Model Compression

1674 directly classified papers

Papers per year

Papers

Overcoming Forgetting Catastrophe in Quantization-Aware Training ICCV 2023

BiViT: Extremely Compressed Binary Vision Transformers ICCV 2023

Dynamic Token Pruning in Plain Vision Transformers for Semantic Segmentation ICCV 2023

TexQ: Zero-shot Network Quantization with Texture Feature Distribution Calibration NIPS 2023

Polynomially Over-Parameterized Convolutional Neural Networks Contain Structured Strong Winning Lottery Tickets NIPS 2023

ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer NIPS 2023

Kronecker-Factored Approximate Curvature for Modern Neural Network Architectures NIPS 2023

Binarized Spectral Compressive Imaging NIPS 2023

The Emergence of Essential Sparsity in Large Pre-trained Models: The Weights that Matter NIPS 2023

You Only Condense Once: Two Rules for Pruning Condensed Datasets NIPS 2023

Understanding Neural Network Binarization with Forward and Backward Proximal Quantizers NIPS 2023

Learning Re-sampling Methods with Parameter Attribution for Image Super-resolution NIPS 2023

BiMatting: Efficient Video Matting via Binarization NIPS 2023

Progressive Ensemble Distillation: Building Ensembles for Efficient Inference NIPS 2023

Temporal Dynamic Quantization for Diffusion Models NIPS 2023

SUBP: Soft Uniform Block Pruning for 1$\times$N Sparse CNNs Multithreading Acceleration NIPS 2023

Optimal Parameter and Neuron Pruning for Out-of-Distribution Detection NIPS 2023

Scissorhands: Exploiting the Persistence of Importance Hypothesis for LLM KV Cache Compression at Test Time NIPS 2023

GOHSP: A Unified Framework of Graph and Optimization-Based Heterogeneous Structured Pruning for Vision Transformer AAAI 2023

GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints EMNLP 2023

Sparse Low-rank Adaptation of Pre-trained Language Models EMNLP 2023

DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-efficient Fine-Tuning ICCV 2023

Shapley Head Pruning: Identifying and Removing Interference in Multilingual Transformers EACL 2023

Teacher Intervention: Improving Convergence of Quantization Aware Training for Ultra-Low Precision Transformers EACL 2023

Outlier Suppression+: Accurate quantization of large language models by equivalent and effective shifting and scaling EMNLP 2023