← Optimization & Theory

Deep Learning › Optimization & Theory ›

Model Compression

1674 directly classified papers

Papers per year

Papers

Gemel: Model Merging for Memory-Efficient, Real-Time Video Analytics at the Edge NSDI 2023

Towards Robust Pruning: An Adaptive Knowledge-Retention Pruning Strategy for Language Models EMNLP 2023

LLM-FP4: 4-Bit Floating-Point Quantized Transformers EMNLP 2023

Compressing and Debiasing Vision-Language Pre-Trained Models for Visual Question Answering EMNLP 2023

I-ViT: Integer-only Quantization for Efficient Vision Transformer Inference ICCV 2023

Effectiveness of Data Augmentation for Parameter Efficient Tuning with Limited Data ACL 2023

Joint Token Pruning and Squeezing Towards More Aggressive Compression of Vision Transformers CVPR 2023

Machine Translation with Large Language Models: Prompting, Few-shot Learning, and Fine-tuning with QLoRA EMNLP 2023

MUX-PLMs: Pre-training Language Models with Data Multiplexing ACL 2023

EfficientVLM: Fast and Accurate Vision-Language Models via Knowledge Distillation and Modal-adaptive Pruning ACL 2023

Balanced Column-Wise Block Pruning for Maximizing GPU Parallelism AAAI 2023

ElasticViT: Conflict-aware Supernet Training for Deploying Fast Vision Transformer on Diverse Mobile Devices ICCV 2023

Structured Pruning for Efficient Generative Pre-trained Language Models ACL 2023

Modular Transformers: Compressing Transformers into Modularized Layers for Flexible Efficient Inference ACL 2023

LightFormer: Light-weight Transformer Using SVD-based Weight Transfer and Parameter Sharing ACL 2023

Are Intermediate Layers and Labels Really Necessary? A General Language Model Distillation Method ACL 2023

AutoMoE: Heterogeneous Mixture-of-Experts with Adaptive Computation for Efficient Neural Machine Translation ACL 2023

Neural Architecture Search for Parameter-Efficient Fine-tuning of Large Pre-trained Language Models ACL 2023

Distill or Annotate? Cost-Efficient Fine-Tuning of Compact Models ACL 2023

Rehearsal-free Continual Language Learning via Efficient Parameter Isolation ACL 2023

GreenKGC: A Lightweight Knowledge Graph Completion Method ACL 2023

Revisiting Token Dropping Strategy in Efficient BERT Pretraining ACL 2023

ESL-SNNs: An Evolutionary Structure Learning Strategy for Spiking Neural Networks AAAI 2023

CSTAR: Towards Compact and Structured Deep Neural Networks with Adversarial Robustness AAAI 2023

Can We Find Strong Lottery Tickets in Generative Models? AAAI 2023