Co-occurring keywords
Papers
Tree-Like Decision Distillation
CVPR 2021
AutoTinyBERT: Automatic Hyper-parameter Optimization for Efficient Pre-trained Language Models
ACL 2021
Revisiting Pretraining with Adapters
ACL 2021
In-Batch Negatives for Knowledge Distillation with Tightly-Coupled Teachers for Dense Retrieval
ACL 2021
Refining Sample Embeddings with Relation Prototypes to Enhance Continual Relation Extraction
ACL 2021
How to Train BERT with an Academic Budget
EMNLP 2021