Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Core AI
Artificial Intelligence
›
Core AI
›
Model Compression
1928 directly classified papers
Papers per year
2013: 2
2014: 1
2015: 6
2016: 4
2017: 13
2018: 47
2019: 81
2020: 114
2021: 172
2022: 191
2023: 272
2024: 370
2025: 489
2026: 166
Papers
AlphaLoRA: Assigning LoRA Experts Based on Layer Training Quality
EMNLP 2024
ApiQ: Finetuning of 2-Bit Quantized Large Language Model
EMNLP 2024
BS-PLCNet 2: Two-stage Band-split Packet Loss Concealment Network with Intra-model Knowledge Distillation
INTERSPEECH 2024
Extremely efficient online query encoding for dense retrieval
NAACL 2024
Cherry on Top: Parameter Heterogeneity and Quantization in Large Language Models
NIPS 2024
Finding Lottery Tickets in Vision Models via Data-driven Spectral Foresight Pruning
CVPR 2024
Resource-Efficient Transformer Pruning for Finetuning of Large Models
CVPR 2024
Attention-Driven Training-Free Efficiency Enhancement of Diffusion Models
CVPR 2024
Memory-Efficient Fine-Tuning of Transformers via Token Selection
EMNLP 2024
MERGE: Fast Private Text Generation
AAAI 2024
UniPTS: A Unified Framework for Proficient Post-Training Sparsity
CVPR 2024
Fairness-Aware Structured Pruning in Transformers
AAAI 2024
All Rivers Run to the Sea: Private Learning with Asymmetric Flows
CVPR 2024
NYU CTF Bench: A Scalable Open-Source Benchmark Dataset for Evaluating LLMs in Offensive Security
NIPS 2024
Rebuilding ROME : Resolving Model Collapse during Sequential Model Editing
EMNLP 2024
DiffHammer: Rethinking the Robustness of Diffusion-Based Adversarial Purification
NIPS 2024
To Forget or Not? Towards Practical Knowledge Unlearning for Large Language Models
EMNLP 2024
Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs
EMNLP 2024
ATQ: Activation Transformation forWeight-Activation Quantization of Large Language Models
EMNLP 2024
BPQP: A Differentiable Convex Optimization Framework for Efficient End-to-End Learning
NIPS 2024
Structured Optimal Brain Pruning for Large Language Models
EMNLP 2024
SecCoder: Towards Generalizable and Robust Secure Code Generation
EMNLP 2024
ALPS: Improved Optimization for Highly Sparse One-Shot Pruning for Large Language Models
NIPS 2024
SparseGrad: A Selective Method for Efficient Fine-tuning of MLP Layers
EMNLP 2024
Mitigating the Language Mismatch and Repetition Issues in LLM-based Machine Translation via Model Editing
EMNLP 2024
<
1
…
34
35
36
…
78
>