Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Core AI
Artificial Intelligence
›
Core AI
›
Model Compression
1928 directly classified papers
Papers per year
2013: 2
2014: 1
2015: 6
2016: 4
2017: 13
2018: 47
2019: 81
2020: 114
2021: 172
2022: 191
2023: 272
2024: 370
2025: 489
2026: 166
Papers
HitKV: Activation Frequency Knows Which Tokens Are Important
AAAI 2026
SpecQuant: Spectral Decomposition and Adaptive Truncation for Ultra-Low-Bit LLMs Quantization
AAAI 2026
First-Order Error Matters: Accurate Compensation for Quantized Large Language Models
AAAI 2026
Share Your Attention: Transformer Weight Sharing via Matrix-based Dictionary Learning
AAAI 2026
GlitchCleaner: Lightweight Glitch Tokens Repairing by Lossless Gated LoRA in Large Language Models
AAAI 2026
DUP: Detection-guided Unlearning for Backdoor Purification in Language Models
AAAI 2026
Efficient Plug-and-Play Weight Refinement for Sparse Large Models
AAAI 2026
D2 Prune: Sparsifying Large Language Models via Dual Taylor Expansion and Attention Distribution Awareness
AAAI 2026
QuEPT: Quantized Elastic Precision Transformers with One-Shot Calibration for Multi-Bit Switching
AAAI 2026
CAMERA: Multi-Matrix Joint Compression for MoE Models via Micro-Expert Redundancy Analysis
AAAI 2026
Distillation-Guided Structural Transfer for Continual Learning Beyond Sparse Distributed Memory
AAAI 2026
Improving Generalization in LLM Structured Pruning via Function-Aware Neuron Grouping
AAAI 2026
Lethe: Layer- and Time-Adaptive KV Cache Pruning for Reasoning-Intensive LLM Serving
AAAI 2026
FP=XINT: Representing Neural Networks via Low-Bit Series Basis Functions
AAAI 2026
MultiKD: Backdoor Defense in Federated Graph Learning via Attention-Guided Multi-Teacher Distillation
AAAI 2026
MP-ISMoE: Mixed-Precision Interactive Side Mixture-of-Experts for Efficient Transfer Learning
AAAI 2026
FedSEA-LLaMA: A Secure, Efficient and Adaptive Federated Splitting Framework for Large Language Models
AAAI 2026
Neural-Augmented Kelvinlet for Real-Time Soft Tissue Deformation Modeling
AAAI 2026
DQT: Dynamic Quantization Training via Dequantization-Free Nested Integer Arithmetic
AAAI 2026
FlashSVD: Memory-Efficient Inference with Streaming for Low-Rank Models
AAAI 2026
MemeBQ:Memory Efficient Binary Quantization of LLMs
AAAI 2026
LLA: Enhancing Security and Privacy for Generative Models with Logic-Locked Accelerators
AAAI 2026
Learnable Permutation for Structured Sparsity on Transformer Models
AAAI 2026
On the Impact of Weight Quantization on Deep Neural Network Uncertainty
AAAI 2026
Balanced Knowledge Distillation for Large Language Models with Mix-of-Experts
AAAI 2026
<
1
2
3
4
5
…
78
>