Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Application Areas
Machine Learning
›
Application Areas
›
Model Compression
1503 directly classified papers
Papers per year
2006: 2
2010: 2
2011: 1
2013: 5
2014: 3
2015: 4
2016: 3
2017: 14
2018: 36
2019: 55
2020: 117
2021: 171
2022: 172
2023: 175
2024: 331
2025: 402
2026: 10
Papers
Reasons and Solutions for the Decline in Model Performance after Editing
NIPS 2024
Group and Shuffle: Efficient Structured Orthogonal Parametrization
NIPS 2024
FALCON: FLOP-Aware Combinatorial Optimization for Neural Network Pruning
AISTATS 2024
Robustness-Guided Image Synthesis for Data-Free Quantization
AAAI 2024
AdapterGNN: Parameter-Efficient Fine-Tuning Improves Generalization in GNNs
AAAI 2024
Distilling Autoregressive Models to Obtain High-Performance Non-autoregressive Solvers for Vehicle Routing Problems with Faster Inference Speed
AAAI 2024
ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification
NIPS 2024
FM-Delta: Lossless Compression for Storing Massive Fine-tuned Foundation Models
NIPS 2024
Parsimony or Capability? Decomposition Delivers Both in Long-term Time Series Forecasting
NIPS 2024
Federated Model Heterogeneous Matryoshka Representation Learning
NIPS 2024
Fluctuation-Based Adaptive Structured Pruning for Large Language Models
AAAI 2024
Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge Distillation
NIPS 2024
Safe LoRA: The Silver Lining of Reducing Safety Risks when Finetuning Large Language Models
NIPS 2024
ScaleKD: Strong Vision Transformers Could Be Excellent Teachers
NIPS 2024
PTQ4DiT: Post-training Quantization for Diffusion Transformers
NIPS 2024
Induced Model Matching: Restricted Models Help Train Full-Featured Models
NIPS 2024
BOLD: Boolean Logic Deep Learning
NIPS 2024
ShareBERT: Embeddings Are Capable of Learning Hidden Layers
AAAI 2024
On Sampling Strategies for Spectral Model Sharding
NIPS 2024
Cherry on Top: Parameter Heterogeneity and Quantization in Large Language Models
NIPS 2024
S$^{2}$FT: Efficient, Scalable and Generalizable LLM Fine-tuning by Structured Sparsity
NIPS 2024
QTIP: Quantization with Trellises and Incoherence Processing
NIPS 2024
LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning
NIPS 2024
WAGLE: Strategic Weight Attribution for Effective and Modular Unlearning in Large Language Models
NIPS 2024
LoRA-GA: Low-Rank Adaptation with Gradient Approximation
NIPS 2024
<
1
…
25
26
27
…
61
>