Artificial Intelligence › Core AI ›

Model Compression

1928 directly classified papers

Papers per year

Papers

SoLA: Leveraging Soft Activation Sparsity and Low-Rank Decomposition for Large Language Model Compression AAAI 2025

BigMac: A Communication-Efficient Mixture-of-Experts Model Structure for Fast Training and Inference AAAI 2025

Sample-aware Adaptive Structured Pruning for Large Language Models AAAI 2025

RILQ: Rank-Insensitive LoRA-Based Quantization Error Compensation for Boosting 2-Bit Large Language Model Accuracy AAAI 2025

Toward Adaptive Large Language Models Structured Pruning via Hybrid-grained Weight Importance Assessment AAAI 2025

From PEFT to DEFT: Parameter Efficient Finetuning for Reducing Activation Density in Transformers AAAI 2025

Pushing the Limits of BFP on Narrow Precision LLM Inference AAAI 2025

MTL-LoRA: Low-Rank Adaptation for Multi-Task Learning AAAI 2025

ScaleOT: Privacy-utility-scalable Offsite-tuning with Dynamic LayerReplace and Selective Rank Compression AAAI 2025

Fit and Prune: Fast and Training-free Visual Token Pruning for Multi-modal Large Language Models AAAI 2025

Treasures in Discarded Weights for LLM Quantization AAAI 2025

ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models AAAI 2025

Channel Merging: Preserving Specialization for Merged Experts AAAI 2025

ASER: Activation Smoothing and Error Reconstruction for Large Language Model Quantization AAAI 2025

MeRino: Entropy-Driven Design for Generative Language Models on IoT Devices AAAI 2025

Practical Offloading for Fine-Tuning LLM on Commodity GPU via Learned Sparse Projectors AAAI 2025

Security Attacks on LLM-based Code Completion Tools AAAI 2025

COSEE: Consistency-Oriented Signal-Based Early Exiting via Calibrated Sample Weighting Mechanism AAAI 2025

AdaSkip: Adaptive Sublayer Skipping for Accelerating Long-Context LLM Inference AAAI 2025

Gradient Weight-normalized Low-rank Projection for Efficient LLM Training AAAI 2025

Pruning Large Language Models with Semi-Structural Adaptive Sparse Training AAAI 2025

ScholarGEC: Enhancing Controllability of Large Language Model for Chinese Academic Grammatical Error Correction AAAI 2025

Enhancing Large Language Model Performance with Gradient-Based Parameter Selection AAAI 2025

Exploring Model Editing for LLM-based Aspect-Based Sentiment Classification AAAI 2025

Self-calibration for Language Model Quantization and Pruning NAACL 2025