← Optimization & Theory

Deep Learning › Optimization & Theory ›

Model Compression

1674 directly classified papers

Papers per year

Papers

Quantized Can Still Be Calibrated: A Unified Framework to Calibration in Quantized Large Language Models ACL 2025

Low-Bit Quantization Favors Undertrained LLMs ACL 2025

Training-free LLM Merging for Multi-task Learning ACL 2025

Speed Without Sacrifice: Fine-Tuning Language Models with Medusa and Knowledge Distillation in Travel Applications ACL 2025

TaDA: Training-free recipe for Decoding with Adaptive KV Cache Compression and Mean-centering ACL 2025

One-for-All Pruning: A Universal Model for Customized Compression of Large Language Models ACL 2025

Revisiting LoRA through the Lens of Parameter Redundancy: Spectral Encoding Helps ACL 2025

MPQ-DM: Mixed Precision Quantization for Extremely Low Bit Diffusion Models AAAI 2025

Squeezed Attention: Accelerating Long Context Length LLM Inference ACL 2025

TCAQ-DM: Timestep-Channel Adaptive Quantization for Diffusion Models AAAI 2025

ESC: Erasing Space Concept for Knowledge Deletion CVPR 2025

ConCISE: Confidence-guided Compression in Step-by-step Efficient Reasoning EMNLP 2025

ZJUKLAB at SemEval-2025 Task 4: Unlearning via Model Merging. ACL 2025

CodeArena: Evaluating and Aligning CodeLLMs on Human Preference EMNLP 2025

Extended Abstract: Probing-Guided Parameter-Efficient Fine-Tuning for Balancing Linguistic Adaptation and Safety in LLM-based Social Influence Systems ACL 2025

BitNet: 1-bit Pre-training for Large Language Models JMLR 2025

OAC: Output-adaptive Calibration for Accurate Post-training Quantization AAAI 2025

Towards Lossless Implicit Neural Representation via Bit Plane Decomposition CVPR 2025

BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices CVPR 2025

Slender-Mamba: Fully Quantized Mamba in 1.58 Bits From Head to Toe COLING 2025

VQ4DiT: Efficient Post-Training Vector Quantization for Diffusion Transformers AAAI 2025

MimiQ: Low-Bit Data-Free Quantization of Vision Transformers with Encouraging Inter-Head Attention Similarity AAAI 2025

DELT: A Simple Diversity-driven EarlyLate Training for Dataset Distillation CVPR 2025

Lightweight Safety Guardrails Using Fine-tuned BERT Embeddings COLING 2025

LeanK: Learnable K Cache Channel Pruning for Efficient Decoding EMNLP 2025