Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Application Areas
Machine Learning
›
Application Areas
›
Model Compression
1503 directly classified papers
Papers per year
2006: 2
2010: 2
2011: 1
2013: 5
2014: 3
2015: 4
2016: 3
2017: 14
2018: 36
2019: 55
2020: 117
2021: 171
2022: 172
2023: 175
2024: 331
2025: 402
2026: 10
Papers
RoseLoRA: Row and Column-wise Sparse Low-rank Adaptation of Pre-trained Language Model for Knowledge Editing and Fine-tuning
EMNLP 2024
Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models
EMNLP 2024
Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General Tasks
EMNLP 2024
Crayon: Customized On-Device LLM via Instant Adapter Blending and Edge-Server Hybrid Inference
ACL 2024
WRP: Weight Recover Prune for Structured Sparsity
ACL 2024
Learning To Compose SuperWeights for Neural Parameter Allocation Search
WACV 2024
MediSwift: Efficient Sparse Pre-trained Biomedical Language Models
ACL 2024
QTIP: Quantization with Trellises and Incoherence Processing
NIPS 2024
LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning
NIPS 2024
Revisiting Neural Networks for Continual Learning: An Architectural Perspective
IJCAI 2024
Partial Binarization of Neural Networks for Budget-Aware Efficient Learning
WACV 2024
BiPFT: Binary Pre-trained Foundation Transformer with Low-Rank Estimation of Binarization Residual Polynomials
AAAI 2024
BVT-IMA: Binary Vision Transformer with Information-Modified Attention
AAAI 2024
AQ-DETR: Low-Bit Quantized Detection Transformer with Auxiliary Queries
AAAI 2024
Real-Time User-Guided Adaptive Colorization With Vision Transformer
WACV 2024
Building Variable-Sized Models via Learngene Pool
AAAI 2024
REPrune: Channel Pruning via Kernel Representative Selection
AAAI 2024
One-Step Forward and Backtrack: Overcoming Zig-Zagging in Loss-Aware Quantization Training
AAAI 2024
UniADS: Universal Architecture-Distiller Search for Distillation Gap
AAAI 2024
Token Fusion: Bridging the Gap Between Token Pruning and Token Merging
WACV 2024
Efficient Multi-task LLM Quantization and Serving for Multiple LoRA Adapters
NIPS 2024
Exploring Domain Robust Lightweight Reward Models based on Router Mechanism
ACL 2024
Backdoor Attacks via Machine Unlearning
AAAI 2024
UPDP: A Unified Progressive Depth Pruner for CNN and Vision Transformer
AAAI 2024
AdapterGNN: Parameter-Efficient Fine-Tuning Improves Generalization in GNNs
AAAI 2024
<
1
…
21
22
23
…
61
>