Co-occurring keywords
Papers
Are Intermediate Layers and Labels Really Necessary? A General Language Model Distillation Method
ACL 2023
A Study on Knowledge Distillation from Weak Teacher for Scaling Up Pre-trained Language Models
ACL 2023
Continual Generalized Intent Discovery: Marching Towards Dynamic and Open-world Intent Recognition
EMNLP 2023
Knowledge Distillation on Joint Task End-to-End Speech Translation
INTERSPEECH 2023
Connective Prediction for Implicit Discourse Relation Recognition via Knowledge Distillation
ACL 2023
FutureTOD: Teaching Future Knowledge to Pre-trained Language Model for Task-Oriented Dialogue
ACL 2023
Local or Global: Selective Knowledge Assimilation for Federated Learning with Limited Labels
ICCV 2023
A Teacher-Student Approach for Extracting Informative Speaker Embeddings From Speech Mixtures
INTERSPEECH 2023