Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Core AI
Artificial Intelligence
›
Core AI
›
Model Compression
1928 directly classified papers
Papers per year
2013: 2
2014: 1
2015: 6
2016: 4
2017: 13
2018: 47
2019: 81
2020: 114
2021: 172
2022: 191
2023: 272
2024: 370
2025: 489
2026: 166
Papers
SoLA: Leveraging Soft Activation Sparsity and Low-Rank Decomposition for Large Language Model Compression
AAAI 2025
BigMac: A Communication-Efficient Mixture-of-Experts Model Structure for Fast Training and Inference
AAAI 2025
Sample-aware Adaptive Structured Pruning for Large Language Models
AAAI 2025
RILQ: Rank-Insensitive LoRA-Based Quantization Error Compensation for Boosting 2-Bit Large Language Model Accuracy
AAAI 2025
Toward Adaptive Large Language Models Structured Pruning via Hybrid-grained Weight Importance Assessment
AAAI 2025
From PEFT to DEFT: Parameter Efficient Finetuning for Reducing Activation Density in Transformers
AAAI 2025
Pushing the Limits of BFP on Narrow Precision LLM Inference
AAAI 2025
MTL-LoRA: Low-Rank Adaptation for Multi-Task Learning
AAAI 2025
ScaleOT: Privacy-utility-scalable Offsite-tuning with Dynamic LayerReplace and Selective Rank Compression
AAAI 2025
Fit and Prune: Fast and Training-free Visual Token Pruning for Multi-modal Large Language Models
AAAI 2025
Treasures in Discarded Weights for LLM Quantization
AAAI 2025
ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models
AAAI 2025
Channel Merging: Preserving Specialization for Merged Experts
AAAI 2025
ASER: Activation Smoothing and Error Reconstruction for Large Language Model Quantization
AAAI 2025
MeRino: Entropy-Driven Design for Generative Language Models on IoT Devices
AAAI 2025
Practical Offloading for Fine-Tuning LLM on Commodity GPU via Learned Sparse Projectors
AAAI 2025
Security Attacks on LLM-based Code Completion Tools
AAAI 2025
COSEE: Consistency-Oriented Signal-Based Early Exiting via Calibrated Sample Weighting Mechanism
AAAI 2025
AdaSkip: Adaptive Sublayer Skipping for Accelerating Long-Context LLM Inference
AAAI 2025
Gradient Weight-normalized Low-rank Projection for Efficient LLM Training
AAAI 2025
Pruning Large Language Models with Semi-Structural Adaptive Sparse Training
AAAI 2025
ScholarGEC: Enhancing Controllability of Large Language Model for Chinese Academic Grammatical Error Correction
AAAI 2025
Enhancing Large Language Model Performance with Gradient-Based Parameter Selection
AAAI 2025
Exploring Model Editing for LLM-based Aspect-Based Sentiment Classification
AAAI 2025
Self-calibration for Language Model Quantization and Pruning
NAACL 2025
<
1
…
15
16
17
…
78
>