Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Optimization & Theory
Deep Learning
›
Optimization & Theory
›
Model Compression
1674 directly classified papers
Papers per year
2012: 1
2013: 2
2014: 2
2015: 7
2016: 9
2017: 27
2018: 51
2019: 79
2020: 189
2021: 165
2022: 206
2023: 207
2024: 325
2025: 399
2026: 5
Papers
Gemel: Model Merging for Memory-Efficient, Real-Time Video Analytics at the Edge
NSDI 2023
Towards Robust Pruning: An Adaptive Knowledge-Retention Pruning Strategy for Language Models
EMNLP 2023
LLM-FP4: 4-Bit Floating-Point Quantized Transformers
EMNLP 2023
Compressing and Debiasing Vision-Language Pre-Trained Models for Visual Question Answering
EMNLP 2023
I-ViT: Integer-only Quantization for Efficient Vision Transformer Inference
ICCV 2023
Effectiveness of Data Augmentation for Parameter Efficient Tuning with Limited Data
ACL 2023
Joint Token Pruning and Squeezing Towards More Aggressive Compression of Vision Transformers
CVPR 2023
Machine Translation with Large Language Models: Prompting, Few-shot Learning, and Fine-tuning with QLoRA
EMNLP 2023
MUX-PLMs: Pre-training Language Models with Data Multiplexing
ACL 2023
EfficientVLM: Fast and Accurate Vision-Language Models via Knowledge Distillation and Modal-adaptive Pruning
ACL 2023
Balanced Column-Wise Block Pruning for Maximizing GPU Parallelism
AAAI 2023
ElasticViT: Conflict-aware Supernet Training for Deploying Fast Vision Transformer on Diverse Mobile Devices
ICCV 2023
Structured Pruning for Efficient Generative Pre-trained Language Models
ACL 2023
Modular Transformers: Compressing Transformers into Modularized Layers for Flexible Efficient Inference
ACL 2023
LightFormer: Light-weight Transformer Using SVD-based Weight Transfer and Parameter Sharing
ACL 2023
Are Intermediate Layers and Labels Really Necessary? A General Language Model Distillation Method
ACL 2023
AutoMoE: Heterogeneous Mixture-of-Experts with Adaptive Computation for Efficient Neural Machine Translation
ACL 2023
Neural Architecture Search for Parameter-Efficient Fine-tuning of Large Pre-trained Language Models
ACL 2023
Distill or Annotate? Cost-Efficient Fine-Tuning of Compact Models
ACL 2023
Rehearsal-free Continual Language Learning via Efficient Parameter Isolation
ACL 2023
GreenKGC: A Lightweight Knowledge Graph Completion Method
ACL 2023
Revisiting Token Dropping Strategy in Efficient BERT Pretraining
ACL 2023
ESL-SNNs: An Evolutionary Structure Learning Strategy for Spiking Neural Networks
AAAI 2023
CSTAR: Towards Compact and Structured Deep Neural Networks with Adversarial Robustness
AAAI 2023
Can We Find Strong Lottery Tickets in Generative Models?
AAAI 2023
<
1
…
35
36
37
…
67
>