← Optimization & Theory

Deep Learning › Optimization & Theory ›

Model Compression

1674 directly classified papers

Papers per year

Papers

BOLD: Boolean Logic Deep Learning NIPS 2024

VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models EMNLP 2024

Breaking ReLU Barrier: Generalized MoEfication for Dense Pretrained Models EMNLP 2024

Mixture-of-Subspaces in Low-Rank Adaptation EMNLP 2024

LoRA-Guard: Parameter-Efficient Guardrail Adaptation for Content Moderation of Large Language Models EMNLP 2024

Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization NIPS 2024

Taylor Unswift: Secured Weight Release for Large Language Models via Taylor Expansion EMNLP 2024

Layer Attack Unlearning: Fast and Accurate Machine Unlearning via Layer Level Attack and Knowledge Distillation AAAI 2024

Provable Robustness against a Union of L_0 Adversarial Attacks AAAI 2024

EAGLE-2: Faster Inference of Language Models with Dynamic Draft Trees EMNLP 2024

HEPrune: Fast Private Training of Deep Neural Networks With Encrypted Data Pruning NIPS 2024

StepbaQ: Stepping backward as Correction for Quantized Diffusion Models NIPS 2024

PTQ4DiT: Post-training Quantization for Diffusion Transformers NIPS 2024

Efficient Multi-task LLM Quantization and Serving for Multiple LoRA Adapters NIPS 2024

Find the Lady: Permutation and Re-synchronization of Deep Neural Networks AAAI 2024

ShareBERT: Embeddings Are Capable of Learning Hidden Layers AAAI 2024

CIDR: A Cooperative Integrated Dynamic Refining Method for Minimal Feature Removal Problem AAAI 2024

Memory-Efficient Reversible Spiking Neural Networks AAAI 2024

PTMQ: Post-training Multi-Bit Quantization of Neural Networks AAAI 2024

BiPFT: Binary Pre-trained Foundation Transformer with Low-Rank Estimation of Binarization Residual Polynomials AAAI 2024

Test-Time Domain Adaptation by Learning Domain-Aware Batch Normalization AAAI 2024

BVT-IMA: Binary Vision Transformer with Information-Modified Attention AAAI 2024

Towards Efficient Verification of Quantized Neural Networks AAAI 2024

AQ-DETR: Low-Bit Quantized Detection Transformer with Auxiliary Queries AAAI 2024

Building Variable-Sized Models via Learngene Pool AAAI 2024