knowledge distillation
3680 papers
Also known as
CD
DMD
LORA
KL
DNA
SELF-DISTILLATION
TKD
NBOD
AD
KD
AOTD
KI
GID
FD
MKD
SEQKD
Co-occurring keywords
Papers
ZeroQuant: Efficient and Affordable Post-Training Quantization for Large-Scale Transformers
NIPS 2022
Ensembling and Knowledge Distilling of Large Sequence Taggers for Grammatical Error Correction
ACL 2022
Sparse Progressive Distillation: Resolving Overfitting under Pretrain-and-Finetune Paradigm
ACL 2022