Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Optimization & Theory
Deep Learning
›
Optimization & Theory
›
Model Compression
1674 directly classified papers
Papers per year
2012: 1
2013: 2
2014: 2
2015: 7
2016: 9
2017: 27
2018: 51
2019: 79
2020: 189
2021: 165
2022: 206
2023: 207
2024: 325
2025: 399
2026: 5
Papers
BOLD: Boolean Logic Deep Learning
NIPS 2024
VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models
EMNLP 2024
Breaking ReLU Barrier: Generalized MoEfication for Dense Pretrained Models
EMNLP 2024
Mixture-of-Subspaces in Low-Rank Adaptation
EMNLP 2024
LoRA-Guard: Parameter-Efficient Guardrail Adaptation for Content Moderation of Large Language Models
EMNLP 2024
Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization
NIPS 2024
Taylor Unswift: Secured Weight Release for Large Language Models via Taylor Expansion
EMNLP 2024
Layer Attack Unlearning: Fast and Accurate Machine Unlearning via Layer Level Attack and Knowledge Distillation
AAAI 2024
Provable Robustness against a Union of L_0 Adversarial Attacks
AAAI 2024
EAGLE-2: Faster Inference of Language Models with Dynamic Draft Trees
EMNLP 2024
HEPrune: Fast Private Training of Deep Neural Networks With Encrypted Data Pruning
NIPS 2024
StepbaQ: Stepping backward as Correction for Quantized Diffusion Models
NIPS 2024
PTQ4DiT: Post-training Quantization for Diffusion Transformers
NIPS 2024
Efficient Multi-task LLM Quantization and Serving for Multiple LoRA Adapters
NIPS 2024
Find the Lady: Permutation and Re-synchronization of Deep Neural Networks
AAAI 2024
ShareBERT: Embeddings Are Capable of Learning Hidden Layers
AAAI 2024
CIDR: A Cooperative Integrated Dynamic Refining Method for Minimal Feature Removal Problem
AAAI 2024
Memory-Efficient Reversible Spiking Neural Networks
AAAI 2024
PTMQ: Post-training Multi-Bit Quantization of Neural Networks
AAAI 2024
BiPFT: Binary Pre-trained Foundation Transformer with Low-Rank Estimation of Binarization Residual Polynomials
AAAI 2024
Test-Time Domain Adaptation by Learning Domain-Aware Batch Normalization
AAAI 2024
BVT-IMA: Binary Vision Transformer with Information-Modified Attention
AAAI 2024
Towards Efficient Verification of Quantized Neural Networks
AAAI 2024
AQ-DETR: Low-Bit Quantized Detection Transformer with Auxiliary Queries
AAAI 2024
Building Variable-Sized Models via Learngene Pool
AAAI 2024
<
1
…
17
18
19
…
67
>