Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Optimization & Theory
Deep Learning
›
Optimization & Theory
›
Model Compression
1674 directly classified papers
Papers per year
2012: 1
2013: 2
2014: 2
2015: 7
2016: 9
2017: 27
2018: 51
2019: 79
2020: 189
2021: 165
2022: 206
2023: 207
2024: 325
2025: 399
2026: 5
Papers
SequentialAttention++ for Block Sparsification: Differentiable Pruning Meets Combinatorial Optimization
NIPS 2024
IT-Tuning : Parameter Efficient Information Token Tuning for Language Model
ACL 2024
UWB at WASSA-2024 Shared Task 2: Cross-lingual Emotion Detection
ACL 2024
Personalized Residuals for Concept-Driven Text-to-Image Generation
CVPR 2024
GTP-ViT: Efficient Vision Transformers via Graph-Based Token Propagation
WACV 2024
Towards Better Structured Pruning Saliency by Reorganizing Convolution
WACV 2024
Torque Based Structured Pruning for Deep Neural Network
WACV 2024
LLM-QAT: Data-Free Quantization Aware Training for Large Language Models
ACL 2024
Unleashing Channel Potential: Space-Frequency Selection Convolution for SAR Object Detection
CVPR 2024
NeuroPrune: A Neuro-inspired Topological Sparse Training Algorithm for Large Language Models
ACL 2024
LM-Cocktail: Resilient Tuning of Language Models via Model Merging
ACL 2024
Disperse-Then-Merge: Pushing the Limits of Instruction Tuning via Alignment Tax Reduction
ACL 2024
Structured Unrestricted-Rank Matrices for Parameter Efficient Finetuning
NIPS 2024
IRPruneDet: Efficient Infrared Small Target Detection via Wavelet Structure-Regularized Soft Channel Pruning
AAAI 2024
Wino Vidi Vici: Conquering Numerical Instability of 8-Bit Winograd Convolution for Accurate Inference Acceleration on Edge
WACV 2024
QTIP: Quantization with Trellises and Incoherence Processing
NIPS 2024
UltraSparseBERT: 99% Conditionally Sparse Language Modelling
ACL 2024
Reducing the Side-Effects of Oscillations in Training of Quantized YOLO Networks
WACV 2024
ResLoRA: Identity Residual Mapping in Low-Rank Adaption
ACL 2024
DB-LLM: Accurate Dual-Binarization for Efficient LLMs
ACL 2024
BASS: Batched Attention-optimized Speculative Sampling
ACL 2024
Value-Based Deep Multi-Agent Reinforcement Learning with Dynamic Sparse Training
NIPS 2024
AFLoRA: Adaptive Freezing of Low Rank Adaptation in Parameter Efficient Fine-Tuning of Large Models
ACL 2024
IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact
ACL 2024
Light-PEFT: Lightening Parameter-Efficient Fine-Tuning via Early Pruning
ACL 2024
<
1
…
25
26
27
…
67
>