Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Core AI
Artificial Intelligence
›
Core AI
›
Model Compression
1928 directly classified papers
Papers per year
2013: 2
2014: 1
2015: 6
2016: 4
2017: 13
2018: 47
2019: 81
2020: 114
2021: 172
2022: 191
2023: 272
2024: 370
2025: 489
2026: 166
Papers
Hypernetworks for Perspectivist Adaptation
EMNLP 2025
CORAL: Learning Consistent Representations across Multi-step Training with Lighter Speculative Drafter
ACL 2025
Recoverable Compression: A Multimodal Vision Token Recovery Mechanism Guided by Text Information
AAAI 2025
BinarySelect to Improve Accessibility of Black-Box Attack Research
COLING 2025
Hopscotch: Discovering and Skipping Redundancies in Language Models
EMNLP 2025
Tied-LoRA: Enhancing parameter efficiency of LoRA with Weight Tying
NAACL 2024
FedLFC: Towards Efficient Federated Multilingual Modeling with LoRA-based Language Family Clustering
NAACL 2024
Extremely efficient online query encoding for dense retrieval
NAACL 2024
ESPACE: Dimensionality Reduction of Activations for Model Compression
NIPS 2024
PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM Compression
NIPS 2024
Edinburgh Clinical NLP at SemEval-2024 Task 2: Fine-tune your model unless you have access to GPT-4
NAACL 2024
Structured Pruning for Large Language Models Using Coupled Components Elimination and Minor Fine-tuning
NAACL 2024
Shears: Unstructured Sparsity with Neural Low-rank Adapter Search
NAACL 2024
Lossless Acceleration of Large Language Model via Adaptive N-gram Parallel Decoding
NAACL 2024
RedCoast: A Lightweight Tool to Automate Distributed Training of LLMs on Any GPU/TPUs
NAACL 2024
Advancing the Robustness of Large Language Models through Self-Denoised Smoothing
NAACL 2024
VeriCompress: A Tool to Streamline the Synthesis of Verified Robust Compressed Neural Networks from Scratch
AAAI 2024
Efficient End-to-End Visual Document Understanding with Rationale Distillation
NAACL 2024
Blind-Touch: Homomorphic Encryption-Based Distributed Neural Network Inference for Privacy-Preserving Fingerprint Authentication
AAAI 2024
Revisiting the Information Capacity of Neural Network Watermarks: Upper Bound Estimation and Beyond
AAAI 2024
ShareBERT: Embeddings Are Capable of Learning Hidden Layers
AAAI 2024
OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Models
AAAI 2024
BitsFusion: 1.99 bits Weight Quantization of Diffusion Model
NIPS 2024
Building on Efficient Foundations: Effective Training of LLMs with Structured Feedforward Layers
NIPS 2024
RoCoIns: Enhancing Robustness of Large Language Models through Code-Style Instructions
COLING 2024
<
1
…
26
27
28
…
78
>