Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Optimization & Theory
Deep Learning
›
Optimization & Theory
›
Model Compression
1674 directly classified papers
Papers per year
2012: 1
2013: 2
2014: 2
2015: 7
2016: 9
2017: 27
2018: 51
2019: 79
2020: 189
2021: 165
2022: 206
2023: 207
2024: 325
2025: 399
2026: 5
Papers
Quantized Can Still Be Calibrated: A Unified Framework to Calibration in Quantized Large Language Models
ACL 2025
Low-Bit Quantization Favors Undertrained LLMs
ACL 2025
Training-free LLM Merging for Multi-task Learning
ACL 2025
Speed Without Sacrifice: Fine-Tuning Language Models with Medusa and Knowledge Distillation in Travel Applications
ACL 2025
TaDA: Training-free recipe for Decoding with Adaptive KV Cache Compression and Mean-centering
ACL 2025
One-for-All Pruning: A Universal Model for Customized Compression of Large Language Models
ACL 2025
Revisiting LoRA through the Lens of Parameter Redundancy: Spectral Encoding Helps
ACL 2025
MPQ-DM: Mixed Precision Quantization for Extremely Low Bit Diffusion Models
AAAI 2025
Squeezed Attention: Accelerating Long Context Length LLM Inference
ACL 2025
TCAQ-DM: Timestep-Channel Adaptive Quantization for Diffusion Models
AAAI 2025
ESC: Erasing Space Concept for Knowledge Deletion
CVPR 2025
ConCISE: Confidence-guided Compression in Step-by-step Efficient Reasoning
EMNLP 2025
ZJUKLAB at SemEval-2025 Task 4: Unlearning via Model Merging.
ACL 2025
CodeArena: Evaluating and Aligning CodeLLMs on Human Preference
EMNLP 2025
Extended Abstract: Probing-Guided Parameter-Efficient Fine-Tuning for Balancing Linguistic Adaptation and Safety in LLM-based Social Influence Systems
ACL 2025
BitNet: 1-bit Pre-training for Large Language Models
JMLR 2025
OAC: Output-adaptive Calibration for Accurate Post-training Quantization
AAAI 2025
Towards Lossless Implicit Neural Representation via Bit Plane Decomposition
CVPR 2025
BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices
CVPR 2025
Slender-Mamba: Fully Quantized Mamba in 1.58 Bits From Head to Toe
COLING 2025
VQ4DiT: Efficient Post-Training Vector Quantization for Diffusion Transformers
AAAI 2025
MimiQ: Low-Bit Data-Free Quantization of Vision Transformers with Encouraging Inter-Head Attention Similarity
AAAI 2025
DELT: A Simple Diversity-driven EarlyLate Training for Dataset Distillation
CVPR 2025
Lightweight Safety Guardrails Using Fine-tuned BERT Embeddings
COLING 2025
LeanK: Learnable K Cache Channel Pruning for Efficient Decoding
EMNLP 2025
<
1
…
15
16
17
…
67
>