knowledge distillation
3680 papers
Also known as
CD
DMD
LORA
KL
DNA
SELF-DISTILLATION
TKD
NBOD
AD
KD
AOTD
KI
GID
FD
MKD
SEQKD
Co-occurring keywords
Papers
When Gradient Descent Meets Derivative-Free Optimization: A Match Made in Black-Box Scenario
ACL 2023
Federated Domain Adaptation for Named Entity Recognition via Distilling with Heterogeneous Tag Sets
ACL 2023
Operator Selection and Ordering in a Pipeline Approach to Efficiency Optimizations for Transformers
ACL 2023