← Application Areas

Machine Learning › Application Areas ›

Model Compression

1503 directly classified papers

Papers per year

Papers

TASO: Task-Aligned Sparse Optimization for Parameter-Efficient Model Adaptation EMNLP 2025

Language Models Can be Efficiently Steered via Minimal Embedding Layer Transformations EMNLP 2025

Walk and Read Less: Improving the Efficiency of Vision-and-Language Navigation via Tuning-Free Multimodal Token Pruning EMNLP 2025

Balcony: A Lightweight Approach to Dynamic Inference of Generative Language Models EMNLP 2025

Profiler: Black-box AI-generated Text Origin Detection via Context-aware Inference Pattern Analysis EMNLP 2025

HELENE: Hessian Layer-wise Clipping and Gradient Annealing for Accelerating Fine-tuning LLM with Zeroth-order Optimization EMNLP 2025

SMEC:Rethinking Matryoshka Representation Learning for Retrieval Embedding Compression EMNLP 2025

GRASP: Replace Redundant Layers with Adaptive Singular Parameters for Efficient Model Compression EMNLP 2025

Studying the Role of Input-Neighbor Overlap in Retrieval-Augmented Language Models Training Efficiency EMNLP 2025

HydraOpt: Navigating the Efficiency-Performance Trade-off of Adapter Merging EMNLP 2025

COUNTDOWN: Contextually Sparse Activation Filtering Out Unnecessary Weights in Down Projection EMNLP 2025

CLMTracing: Black-box User-level Watermarking for Code Language Model Tracing EMNLP 2025

NAYER: Noisy Layer Data Generation for Efficient and Effective Data-free Knowledge Distillation CVPR 2024

Enhancing Post-training Quantization Calibration through Contrastive Learning CVPR 2024

CaKDP: Category-aware Knowledge Distillation and Pruning Framework for Lightweight 3D Object Detection CVPR 2024

Is Modularity Transferable? A Case Study through the Lens of Knowledge Distillation COLING 2024

Mixed-Precision Quantization for Federated Learning on Resource-Constrained Heterogeneous Devices CVPR 2024

Efficient AMR Parsing with CLAP: Compact Linearization with an Adaptable Parser COLING 2024

LLMR: Knowledge Distillation with a Large Language Model-Induced Reward COLING 2024

Shaving Weights with Occam's Razor: Bayesian Sparsification for Neural Networks using the Marginal Likelihood NIPS 2024

SIRIUS : Contexual Sparisty with Correction for Efficient LLMs NIPS 2024

On Giant's Shoulders: Effortless Weak to Strong by Dynamic Logits Fusion NIPS 2024

One More Step: A Versatile Plug-and-Play Module for Rectifying Diffusion Schedule Flaws and Enhancing Low-Frequency Controls CVPR 2024

ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization NIPS 2024

mALBERT: Is a Compact Multilingual BERT Model Still Worth It? COLING 2024