Co-occurring keywords
Papers
The Optimal BERT Surgeon: Scalable and Accurate Second-Order Pruning for Large Language Models
EMNLP 2022
Sparse Teachers Can Be Dense with Knowledge
EMNLP 2022
Multi-level Distillation of Semantic Knowledge for Pre-training Multilingual Language Model
EMNLP 2022
Distill The Image to Nowhere: Inversion Knowledge Distillation for Multimodal Machine Translation
EMNLP 2022
Language Modelling via Learning to Rank
AAAI 2022
Cross-Layer Similarity Knowledge Distillation for Speech Enhancement
INTERSPEECH 2022
Model Compression by Iterative Pruning with Knowledge Distillation and Its Application to Speech Enhancement
INTERSPEECH 2022