Co-occurring keywords
Papers
$\textit{Read-ME}$: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-Design
NIPS 2024
The Iterative Optimal Brain Surgeon: Faster Sparse Recovery by Leveraging Second-Order Information
NIPS 2024
Teaching Tiny Minds: Exploring Methods to Enhance Knowledge Distillation for Small Language Models
CONLL 2024
Adaptive Data-Free Quantization
CVPR 2023
Solving Oscillation Problem in Post-Training Quantization Through a Theoretical Perspective
CVPR 2023
Diffusion Probabilistic Model Made Slim
CVPR 2023