model compression

3283 papers

Explore in graph

Also known as

MC

Co-occurring keywords

knowledge distillation (3680) large language model (12755) neural network (6616) efficient computing (779) neural network optimization (1293) transfer learning (5442) convolutional neural network (4216) neural network pruning (265) language model (4573) parameter efficiency (415)

Papers

Integrating Independent Layer-Wise Rank Selection with Low-Rank SVD Training for Model Compression: A Theory-Driven Approach IJCAI 2025

Scheduling Weight Transitions for Quantization-Aware Training ICCV 2025

Logic Distillation: Learning from Code Function by Function for Decision-making Tasks IJCAI 2025

From PEFT to DEFT: Parameter Efficient Finetuning for Reducing Activation Density in Transformers AAAI 2025

FBQuant: FeedBack Quantization for Large Language Models IJCAI 2025

GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference IJCNLP 2025

Exploring the Trade-Offs: Quantization Methods, Task Difficulty, and Model Size in Large Language Models From Edge to Giant IJCAI 2025

FlexiGPT: Pruning and Extending Large Language Models with Low-Rank Weight Sharing NAACL 2025

Accurate Sublayer Pruning for Large Language Models by Exploiting Latency and Tunability Information IJCAI 2025

LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation EMNLP 2025

C3oT: Generating Shorter Chain-of-Thought Without Compromising Effectiveness AAAI 2025

Scaffold-BPE: Enhancing Byte Pair Encoding for Large Language Models with Simple and Effective Scaffold Token Removal AAAI 2025

A Compact Model for Mathematics Problem Representations Distilled from BERT AAAI 2025

CSR:Achieving 1 Bit Key-Value Cache via Sparse Representation AAAI 2025

SWITCH: Studying with Teacher for Knowledge Distillation of Large Language Models NAACL 2025

Meta-Unlearning on Diffusion Models: Preventing Relearning Unlearned Concepts ICCV 2025

TR-PTS: Task-Relevant Parameter and Token Selection for Efficient Tuning ICCV 2025

QPruner: Probabilistic Decision Quantization for Structured Pruning in Large Language Models NAACL 2025

KDAT: Inherent Adversarial Robustness via Knowledge Distillation with Adversarial Tuning for Object Detection Models AAAI 2025

EA-KD: Entropy-based Adaptive Knowledge Distillation ICCV 2025

Distilling Knowledge from Heterogeneous Architectures for Semantic Segmentation AAAI 2025

An Efficient and Accurate Dynamic Sparse Training Framework Based on Parameter-Freezing AAAI 2025

Efficient Federated Learning via Clients-to-Server Knowledge Distillation (Student Abstract) AAAI 2025

Neural Collapse Inspired Knowledge Distillation AAAI 2025

Enhancing Low-Rank Adaptation with Recoverability-Based Reinforcement Pruning for Object Counting AAAI 2025