Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Application Areas
Machine Learning
›
Application Areas
›
Model Compression
1503 directly classified papers
Papers per year
2006: 2
2010: 2
2011: 1
2013: 5
2014: 3
2015: 4
2016: 3
2017: 14
2018: 36
2019: 55
2020: 117
2021: 171
2022: 172
2023: 175
2024: 331
2025: 402
2026: 10
Papers
TASO: Task-Aligned Sparse Optimization for Parameter-Efficient Model Adaptation
EMNLP 2025
Language Models Can be Efficiently Steered via Minimal Embedding Layer Transformations
EMNLP 2025
Walk and Read Less: Improving the Efficiency of Vision-and-Language Navigation via Tuning-Free Multimodal Token Pruning
EMNLP 2025
Balcony: A Lightweight Approach to Dynamic Inference of Generative Language Models
EMNLP 2025
Profiler: Black-box AI-generated Text Origin Detection via Context-aware Inference Pattern Analysis
EMNLP 2025
HELENE: Hessian Layer-wise Clipping and Gradient Annealing for Accelerating Fine-tuning LLM with Zeroth-order Optimization
EMNLP 2025
SMEC:Rethinking Matryoshka Representation Learning for Retrieval Embedding Compression
EMNLP 2025
GRASP: Replace Redundant Layers with Adaptive Singular Parameters for Efficient Model Compression
EMNLP 2025
Studying the Role of Input-Neighbor Overlap in Retrieval-Augmented Language Models Training Efficiency
EMNLP 2025
HydraOpt: Navigating the Efficiency-Performance Trade-off of Adapter Merging
EMNLP 2025
COUNTDOWN: Contextually Sparse Activation Filtering Out Unnecessary Weights in Down Projection
EMNLP 2025
CLMTracing: Black-box User-level Watermarking for Code Language Model Tracing
EMNLP 2025
NAYER: Noisy Layer Data Generation for Efficient and Effective Data-free Knowledge Distillation
CVPR 2024
Enhancing Post-training Quantization Calibration through Contrastive Learning
CVPR 2024
CaKDP: Category-aware Knowledge Distillation and Pruning Framework for Lightweight 3D Object Detection
CVPR 2024
Is Modularity Transferable? A Case Study through the Lens of Knowledge Distillation
COLING 2024
Mixed-Precision Quantization for Federated Learning on Resource-Constrained Heterogeneous Devices
CVPR 2024
Efficient AMR Parsing with CLAP: Compact Linearization with an Adaptable Parser
COLING 2024
LLMR: Knowledge Distillation with a Large Language Model-Induced Reward
COLING 2024
Shaving Weights with Occam's Razor: Bayesian Sparsification for Neural Networks using the Marginal Likelihood
NIPS 2024
SIRIUS : Contexual Sparisty with Correction for Efficient LLMs
NIPS 2024
On Giant's Shoulders: Effortless Weak to Strong by Dynamic Logits Fusion
NIPS 2024
One More Step: A Versatile Plug-and-Play Module for Rectifying Diffusion Schedule Flaws and Enhancing Low-Frequency Controls
CVPR 2024
ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization
NIPS 2024
mALBERT: Is a Compact Multilingual BERT Model Still Worth It?
COLING 2024
<
1
…
16
17
18
…
61
>