Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Application Areas
Machine Learning
›
Application Areas
›
Model Compression
1503 directly classified papers
Papers per year
2006: 2
2010: 2
2011: 1
2013: 5
2014: 3
2015: 4
2016: 3
2017: 14
2018: 36
2019: 55
2020: 117
2021: 171
2022: 172
2023: 175
2024: 331
2025: 402
2026: 10
Papers
LLM in a flash: Efficient Large Language Model Inference with Limited Memory
ACL 2024
AlterMOMA: Fusion Redundancy Pruning for Camera-LiDAR Fusion Models with Alternative Modality Masking
NIPS 2024
Pipeline Parallelism with Controllable Memory
NIPS 2024
Is Modularity Transferable? A Case Study through the Lens of Knowledge Distillation
COLING 2024
CLIP-KD: An Empirical Study of CLIP Model Distillation
CVPR 2024
Exploiting Activation Sparsity with Dense to Dynamic-k Mixture-of-Experts Conversion
NIPS 2024
Hollowed Net for On-Device Personalization of Text-to-Image Diffusion Models
NIPS 2024
Slicing Vision Transformer for Flexible Inference
NIPS 2024
Unveiling and Mitigating Backdoor Vulnerabilities based on Unlearning Weight Changes and Backdoor Activeness
NIPS 2024
MEFT: Memory-Efficient Fine-Tuning through Sparse Adapter
ACL 2024
Pick-or-Mix: Dynamic Channel Sampling for ConvNets
CVPR 2024
The Ups and Downs of Large Language Model Inference with Vocabulary Trimming by Language Heuristics
NAACL 2024
Simple and Fast Distillation of Diffusion Models
NIPS 2024
HydraViT: Stacking Heads for a Scalable ViT
NIPS 2024
MediSwift: Efficient Sparse Pre-trained Biomedical Language Models
ACL 2024
BiPFT: Binary Pre-trained Foundation Transformer with Low-Rank Estimation of Binarization Residual Polynomials
AAAI 2024
BiDM: Pushing the Limit of Quantization for Diffusion Models
NIPS 2024
Enhancing In-Context Learning Performance with just SVD-Based Weight Pruning: A Theoretical Perspective
NIPS 2024
ALPS: Improved Optimization for Highly Sparse One-Shot Pruning for Large Language Models
NIPS 2024
Sparse maximal update parameterization: A holistic approach to sparse training dynamics
NIPS 2024
Leveraging Normalization Layer in Adapters with Progressive Learning and Adaptive Distillation for Cross-Domain Few-Shot Learning
AAAI 2024
S-STE: Continuous Pruning Function for Efficient 2:4 Sparse Pre-training
NIPS 2024
Adaptive Depth Networks with Skippable Sub-Paths
NIPS 2024
Exploring Domain Robust Lightweight Reward Models based on Router Mechanism
ACL 2024
Linearly Decomposing and Recomposing Vision Transformers for Diverse-Scale Models
NIPS 2024
<
1
…
26
27
28
…
61
>