Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Optimization & Theory
Deep Learning
›
Optimization & Theory
›
Model Compression
1674 directly classified papers
Papers per year
2012: 1
2013: 2
2014: 2
2015: 7
2016: 9
2017: 27
2018: 51
2019: 79
2020: 189
2021: 165
2022: 206
2023: 207
2024: 325
2025: 399
2026: 5
Papers
Overcoming Forgetting Catastrophe in Quantization-Aware Training
ICCV 2023
BiViT: Extremely Compressed Binary Vision Transformers
ICCV 2023
Dynamic Token Pruning in Plain Vision Transformers for Semantic Segmentation
ICCV 2023
TexQ: Zero-shot Network Quantization with Texture Feature Distribution Calibration
NIPS 2023
Polynomially Over-Parameterized Convolutional Neural Networks Contain Structured Strong Winning Lottery Tickets
NIPS 2023
ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer
NIPS 2023
Kronecker-Factored Approximate Curvature for Modern Neural Network Architectures
NIPS 2023
Binarized Spectral Compressive Imaging
NIPS 2023
The Emergence of Essential Sparsity in Large Pre-trained Models: The Weights that Matter
NIPS 2023
You Only Condense Once: Two Rules for Pruning Condensed Datasets
NIPS 2023
Understanding Neural Network Binarization with Forward and Backward Proximal Quantizers
NIPS 2023
Learning Re-sampling Methods with Parameter Attribution for Image Super-resolution
NIPS 2023
BiMatting: Efficient Video Matting via Binarization
NIPS 2023
Progressive Ensemble Distillation: Building Ensembles for Efficient Inference
NIPS 2023
Temporal Dynamic Quantization for Diffusion Models
NIPS 2023
SUBP: Soft Uniform Block Pruning for 1$\times$N Sparse CNNs Multithreading Acceleration
NIPS 2023
Optimal Parameter and Neuron Pruning for Out-of-Distribution Detection
NIPS 2023
Scissorhands: Exploiting the Persistence of Importance Hypothesis for LLM KV Cache Compression at Test Time
NIPS 2023
GOHSP: A Unified Framework of Graph and Optimization-Based Heterogeneous Structured Pruning for Vision Transformer
AAAI 2023
GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints
EMNLP 2023
Sparse Low-rank Adaptation of Pre-trained Language Models
EMNLP 2023
DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-efficient Fine-Tuning
ICCV 2023
Shapley Head Pruning: Identifying and Removing Interference in Multilingual Transformers
EACL 2023
Teacher Intervention: Improving Convergence of Quantization Aware Training for Ultra-Low Precision Transformers
EACL 2023
Outlier Suppression+: Accurate quantization of large language models by equivalent and effective shifting and scaling
EMNLP 2023
<
1
…
34
35
36
…
67
>