conftrace_

knowledge distillation

3725 papers

Explore in graph

Also known as

KD

Co-occurring keywords

model compression (3302) large language model (13587) transfer learning (5449) domain adaptation (4595) representation learning (6206) neural network (6616) language model (4599) catastrophic forgetting (958) continual learning (1181) contrastive learning (4032)

Papers

Towards Data-Free Model Stealing in a Hard Label Setting CVPR 2022

Calibrating Student Models for Emotion-related Tasks EMNLP 2022

CN-AutoMIC: Distilling Chinese Commonsense Knowledge from Pretrained Language Models EMNLP 2022

Distilled Dual-Encoder Model for Vision-Language Understanding EMNLP 2022

ConNER: Consistency Training for Cross-lingual Named Entity Recognition EMNLP 2022

Unifying the Convergences in Multilingual Neural Machine Translation EMNLP 2022

Tiny-NewsRec: Effective and Efficient PLM-based News Recommendation EMNLP 2022

Norm-based Noisy Corpora Filtering and Refurbishing in Neural Machine Translation EMNLP 2022

Wider & Closer: Mixture of Short-channel Distillers for Zero-shot Cross-lingual Named Entity Recognition EMNLP 2022

Evaluating Parameter Efficient Learning for Generation EMNLP 2022

Model Compression Using Optimal Transport WACV 2022

The Optimal BERT Surgeon: Scalable and Accurate Second-Order Pruning for Large Language Models EMNLP 2022

Sparse Teachers Can Be Dense with Knowledge EMNLP 2022

Not to Overfit or Underfit the Source Domains? An Empirical Study of Domain Generalization in Question Answering EMNLP 2022

Multi-level Distillation of Semantic Knowledge for Pre-training Multilingual Language Model EMNLP 2022

Distill The Image to Nowhere: Inversion Knowledge Distillation for Multimodal Machine Translation EMNLP 2022

Language Modelling via Learning to Rank AAAI 2022

Deeply Tensor Compressed Transformers for End-to-End Object Detection AAAI 2022

LGD: Label-Guided Self-Distillation for Object Detection AAAI 2022

Content-Variant Reference Image Quality Assessment via Knowledge Distillation AAAI 2022

Lifelong Person Re-identification by Pseudo Task Knowledge Preservation AAAI 2022

Cross-Layer Similarity Knowledge Distillation for Speech Enhancement INTERSPEECH 2022

Model Compression by Iterative Pruning with Knowledge Distillation and Its Application to Speech Enhancement INTERSPEECH 2022

Self-Distillation Based on High-level Information Supervision for Compressing End-to-End ASR Model INTERSPEECH 2022

ADD: Frequency Attention and Multi-View Based Knowledge Distillation to Detect Low-Quality Compressed Deepfake Images AAAI 2022