← Application Areas

Machine Learning › Application Areas ›

Model Compression

1503 directly classified papers

Papers per year

Papers

LLM in a flash: Efficient Large Language Model Inference with Limited Memory ACL 2024

AlterMOMA: Fusion Redundancy Pruning for Camera-LiDAR Fusion Models with Alternative Modality Masking NIPS 2024

Pipeline Parallelism with Controllable Memory NIPS 2024

Is Modularity Transferable? A Case Study through the Lens of Knowledge Distillation COLING 2024

CLIP-KD: An Empirical Study of CLIP Model Distillation CVPR 2024

Exploiting Activation Sparsity with Dense to Dynamic-k Mixture-of-Experts Conversion NIPS 2024

Hollowed Net for On-Device Personalization of Text-to-Image Diffusion Models NIPS 2024

Slicing Vision Transformer for Flexible Inference NIPS 2024

Unveiling and Mitigating Backdoor Vulnerabilities based on Unlearning Weight Changes and Backdoor Activeness NIPS 2024

MEFT: Memory-Efficient Fine-Tuning through Sparse Adapter ACL 2024

Pick-or-Mix: Dynamic Channel Sampling for ConvNets CVPR 2024

The Ups and Downs of Large Language Model Inference with Vocabulary Trimming by Language Heuristics NAACL 2024

Simple and Fast Distillation of Diffusion Models NIPS 2024

HydraViT: Stacking Heads for a Scalable ViT NIPS 2024

MediSwift: Efficient Sparse Pre-trained Biomedical Language Models ACL 2024

BiPFT: Binary Pre-trained Foundation Transformer with Low-Rank Estimation of Binarization Residual Polynomials AAAI 2024

BiDM: Pushing the Limit of Quantization for Diffusion Models NIPS 2024

Enhancing In-Context Learning Performance with just SVD-Based Weight Pruning: A Theoretical Perspective NIPS 2024

ALPS: Improved Optimization for Highly Sparse One-Shot Pruning for Large Language Models NIPS 2024

Sparse maximal update parameterization: A holistic approach to sparse training dynamics NIPS 2024

Leveraging Normalization Layer in Adapters with Progressive Learning and Adaptive Distillation for Cross-Domain Few-Shot Learning AAAI 2024

S-STE: Continuous Pruning Function for Efficient 2:4 Sparse Pre-training NIPS 2024

Adaptive Depth Networks with Skippable Sub-Paths NIPS 2024

Exploring Domain Robust Lightweight Reward Models based on Router Mechanism ACL 2024

Linearly Decomposing and Recomposing Vision Transformers for Diverse-Scale Models NIPS 2024