Co-occurring keywords
Papers
A Multiple-Teacher Pruning Based Self-Distillation (MT-PSD) Approach to Model Compression for Audio-Visual Wake Word Spotting
INTERSPEECH 2023
Are Intermediate Layers and Labels Really Necessary? A General Language Model Distillation Method
ACL 2023