Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Optimization & Theory
Deep Learning
›
Optimization & Theory
›
Model Compression
1674 directly classified papers
Papers per year
2012: 1
2013: 2
2014: 2
2015: 7
2016: 9
2017: 27
2018: 51
2019: 79
2020: 189
2021: 165
2022: 206
2023: 207
2024: 325
2025: 399
2026: 5
Papers
AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models
NIPS 2024
Layer-Adaptive State Pruning for Deep State Space Models
NIPS 2024
MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision Tasks
NIPS 2024
Shaving Weights with Occam's Razor: Bayesian Sparsification for Neural Networks using the Marginal Likelihood
NIPS 2024
Personalized Residuals for Concept-Driven Text-to-Image Generation
CVPR 2024
Heterogeneous LoRA for Federated Fine-tuning of On-Device Foundation Models
EMNLP 2024
GTP-ViT: Efficient Vision Transformers via Graph-Based Token Propagation
WACV 2024
Where Am I From? Identifying Origin of LLM-generated Content
EMNLP 2024
What Makes Quantization for Large Language Model Hard? An Empirical Study from the Lens of Perturbation
AAAI 2024
DEM: Distribution Edited Model for Training with Mixed Data Distributions
EMNLP 2024
BitDelta: Your Fine-Tune May Only Be Worth One Bit
NIPS 2024
Towards Better Structured Pruning Saliency by Reorganizing Convolution
WACV 2024
Torque Based Structured Pruning for Deep Neural Network
WACV 2024
LION: Implicit Vision Prompt Tuning
AAAI 2024
Efficient Reinforcement Learning by Discovering Neural Pathways
NIPS 2024
Unleashing Channel Potential: Space-Frequency Selection Convolution for SAR Object Detection
CVPR 2024
IRPruneDet: Efficient Infrared Small Target Detection via Wavelet Structure-Regularized Soft Channel Pruning
AAAI 2024
Progressive Distillation Based on Masked Generation Feature Method for Knowledge Graph Completion
AAAI 2024
Norm Tweaking: High-Performance Low-Bit Quantization of Large Language Models
AAAI 2024
Entropy Induced Pruning Framework for Convolutional Neural Networks
AAAI 2024
Breaking ReLU Barrier: Generalized MoEfication for Dense Pretrained Models
EMNLP 2024
Understanding the Role of the Projector in Knowledge Distillation
AAAI 2024
LoRA-Guard: Parameter-Efficient Guardrail Adaptation for Content Moderation of Large Language Models
EMNLP 2024
Wino Vidi Vici: Conquering Numerical Instability of 8-Bit Winograd Convolution for Accurate Inference Acceleration on Edge
WACV 2024
Expanding Sparse Tuning for Low Memory Usage
NIPS 2024
<
1
…
23
24
25
…
67
>