Artificial Intelligence › Core AI ›

Model Compression

1928 directly classified papers

Papers per year

Papers

Hypernetworks for Perspectivist Adaptation EMNLP 2025

CORAL: Learning Consistent Representations across Multi-step Training with Lighter Speculative Drafter ACL 2025

Recoverable Compression: A Multimodal Vision Token Recovery Mechanism Guided by Text Information AAAI 2025

BinarySelect to Improve Accessibility of Black-Box Attack Research COLING 2025

Hopscotch: Discovering and Skipping Redundancies in Language Models EMNLP 2025

Tied-LoRA: Enhancing parameter efficiency of LoRA with Weight Tying NAACL 2024

FedLFC: Towards Efficient Federated Multilingual Modeling with LoRA-based Language Family Clustering NAACL 2024

Extremely efficient online query encoding for dense retrieval NAACL 2024

ESPACE: Dimensionality Reduction of Activations for Model Compression NIPS 2024

PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM Compression NIPS 2024

Edinburgh Clinical NLP at SemEval-2024 Task 2: Fine-tune your model unless you have access to GPT-4 NAACL 2024

Structured Pruning for Large Language Models Using Coupled Components Elimination and Minor Fine-tuning NAACL 2024

Shears: Unstructured Sparsity with Neural Low-rank Adapter Search NAACL 2024

Lossless Acceleration of Large Language Model via Adaptive N-gram Parallel Decoding NAACL 2024

RedCoast: A Lightweight Tool to Automate Distributed Training of LLMs on Any GPU/TPUs NAACL 2024

Advancing the Robustness of Large Language Models through Self-Denoised Smoothing NAACL 2024

VeriCompress: A Tool to Streamline the Synthesis of Verified Robust Compressed Neural Networks from Scratch AAAI 2024

Efficient End-to-End Visual Document Understanding with Rationale Distillation NAACL 2024

Blind-Touch: Homomorphic Encryption-Based Distributed Neural Network Inference for Privacy-Preserving Fingerprint Authentication AAAI 2024

Revisiting the Information Capacity of Neural Network Watermarks: Upper Bound Estimation and Beyond AAAI 2024

ShareBERT: Embeddings Are Capable of Learning Hidden Layers AAAI 2024

OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Models AAAI 2024

BitsFusion: 1.99 bits Weight Quantization of Diffusion Model NIPS 2024

Building on Efficient Foundations: Effective Training of LLMs with Structured Feedforward Layers NIPS 2024

RoCoIns: Enhancing Robustness of Large Language Models through Code-Style Instructions COLING 2024