knowledge distillation
3680 papers
Also known as
CD
DMD
LORA
KL
DNA
SELF-DISTILLATION
TKD
NBOD
AD
KD
AOTD
KI
GID
FD
MKD
SEQKD
Co-occurring keywords
Papers
SciInstruct: a Self-Reflective Instruction Annotated Dataset for Training Scientific Language Models
NIPS 2024
JiuZhang3.0: Efficiently Improving Mathematical Reasoning by Training Small Data Synthesis Models
NIPS 2024
Learning from Teaching Regularization: Generalizable Correlations Should be Easy to Imitate
NIPS 2024