knowledge distillation
3680 papers
Also known as
CD
DMD
LORA
KL
DNA
SELF-DISTILLATION
TKD
NBOD
AD
KD
AOTD
KI
GID
FD
MKD
SEQKD
Co-occurring keywords
Papers
Teacher-Student Training for Debiasing: General Permutation Debiasing for Large Language Models
ACL 2024
Distilling Robustness into Natural Language Inference Models with Domain-Targeted Augmentation
ACL 2024
Enhancing Continual Learning in Visual Question Answering with Modality-Aware Feature Distillation
ACL 2024