knowledge distillation
3680 papers
Also known as
CD
DMD
LORA
KL
DNA
SELF-DISTILLATION
TKD
NBOD
AD
KD
AOTD
KI
GID
FD
MKD
SEQKD
Co-occurring keywords
Papers
A Lightweight Mixture-of-Experts Neural Machine Translation Model with Stage-wise Training Strategy
NAACL 2024
Efficient Citer: Tuning Large Language Models for Enhanced Answer Quality and Verification
NAACL 2024
Leros: Learning Explicit Reasoning on Synthesized Data for Commonsense Question Answering
COLING 2024
Multilingual Brain Surgeon: Large Language Models Can Be Compressed Leaving No Language behind
COLING 2024
Progressive Distillation Based on Masked Generation Feature Method for Knowledge Graph Completion
AAAI 2024
Ungeneralizable Examples
CVPR 2024