knowledge distillation
3680 papers
Also known as
CD
DMD
LORA
KL
DNA
SELF-DISTILLATION
TKD
NBOD
AD
KD
AOTD
KI
GID
FD
MKD
SEQKD
Co-occurring keywords
Papers
Train Big, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers
ICML 2020
A Joint Learning Approach based on Self-Distillation for Keyphrase Extraction from Scientific Documents
COLING 2020