Co-occurring keywords
Papers
EasyDistill: A Comprehensive Toolkit for Effective Knowledge Distillation of Large Language Models
EMNLP 2025
Scaling Down, Serving Fast: Compressing and Deploying Efficient LLMs for Recommendation Systems
EMNLP 2025
AMQ: Enabling AutoML for Mixed-precision Weight-Only Quantization of Large Language Models
EMNLP 2025