model compression

3283 papers

Explore in graph

Also known as

MC

Co-occurring keywords

knowledge distillation (3680) large language model (12755) neural network (6616) efficient computing (779) neural network optimization (1293) transfer learning (5442) convolutional neural network (4216) neural network pruning (265) language model (4573) parameter efficiency (415)

Papers

Stretching Each Dollar: Diffusion Training from Scratch on a Micro-Budget CVPR 2025

Task Singular Vectors: Reducing Task Interference in Model Merging CVPR 2025

SVD-LLM V2: Optimizing Singular Value Truncation for Large Language Model Compression NAACL 2025

APHQ-ViT: Post-Training Quantization with Average Perturbation Hessian Based Reconstruction for Vision Transformers CVPR 2025

Advancing Weight and Channel Sparsification with Enhanced Saliency WACV 2025

Difficulty Diversity and Plausibility: Dynamic Data-Free Quantization WACV 2025

Weight Copy and Low-Rank Adaptation for Few-Shot Distillation of Vision Transformers WACV 2025

InDistill: Information Flow-Preserving Knowledge Distillation for Model Compression WACV 2025

Patch Ranking: Token Pruning as Ranking Prediction for Efficient CLIP WACV 2025

LowFormer: Hardware Efficient Design for Convolutional Transformer Backbones WACV 2025

AAIG at GenAI Detection Task 1: Exploring Syntactically-Aware, Resource-Efficient Small Autoregressive Decoders for AI Content Detection COLING 2025

eLIR-Net: An Efficient AI Solution for Image Retouching WACV 2025

Extracting General-use Transformers for Low-resource Languages via Knowledge Distillation COLING 2025

When Every Token Counts: Optimal Segmentation for Low-Resource Language Models COLING 2025

ZigZagKV: Dynamic KV Cache Compression for Long-context Modeling based on Layer Uncertainty COLING 2025

Let’s Focus on Neuron: Neuron-Level Supervised Fine-tuning for Large Language Model COLING 2025

AMP-ViT: Optimizing Vision Transformer Efficiency with Adaptive Mixed-Precision Post-Training Quantization WACV 2025

Slender-Mamba: Fully Quantized Mamba in 1.58 Bits From Head to Toe COLING 2025

Revisiting Disparity from Dual-Pixel Images: Physics-Informed Lightweight Depth Estimation WACV 2025

Rethinking Kullback-Leibler Divergence in Knowledge Distillation for Large Language Models COLING 2025

Iterative Structured Knowledge Distillation: Optimizing Language Models Through Layer-by-Layer Distillation COLING 2025

DP-FROST: Differentially Private Fine-tuning of Pre-trained Models with Freezing Model Parameters COLING 2025

Enhancing One-Shot Pruned Pre-trained Language Models through Sparse-Dense-Sparse Mechanism COLING 2025

ProSparse: Introducing and Enhancing Intrinsic Activation Sparsity within Large Language Models COLING 2025

OptiPrune: Effective Pruning Approach for Every Target Sparsity COLING 2025