conftrace_

knowledge distillation

3725 papers

Explore in graph

Also known as

KD

Co-occurring keywords

model compression (3302) large language model (13587) transfer learning (5449) domain adaptation (4595) representation learning (6206) neural network (6616) language model (4599) catastrophic forgetting (958) continual learning (1181) contrastive learning (4032)

Papers

Efficient Conditional GAN Transfer With Knowledge Propagation Across Classes CVPR 2021

Importance-based Neuron Allocation for Multilingual Neural Machine Translation ACL 2021

Tree-Like Decision Distillation CVPR 2021

AutoTinyBERT: Automatic Hyper-parameter Optimization for Efficient Pre-trained Language Models ACL 2021

Thinking Fast and Slow: Efficient Text-to-Visual Retrieval With Transformers CVPR 2021

Revisiting Pretraining with Adapters ACL 2021

In-Batch Negatives for Knowledge Distillation with Tightly-Coupled Teachers for Dense Retrieval ACL 2021

Incremental Embedding Learning via Zero-Shot Translation AAAI 2021

Learning to Explain: Generating Stable Explanations Fast ACL 2021

Refining Sample Embeddings with Relation Prototypes to Enhance Continual Relation Extraction ACL 2021

PRAL: A Tailored Pre-Training Model for Task-Oriented Dialog Generation ACL 2021

EvDistill: Asynchronous Events To End-Task Learning via Bidirectional Reconstruction-Guided Cross-Modal Knowledge Distillation CVPR 2021

Data-Free Knowledge Distillation for Image Super-Resolution CVPR 2021

Anomaly Detection in Video via Self-Supervised and Multi-Task Learning CVPR 2021

Joint-DetNAS: Upgrade Your Detector With NAS, Pruning and Dynamic Distillation CVPR 2021

Unbiased Mean Teacher for Cross-Domain Object Detection CVPR 2021

Multilingual Machine Translation Systems from Microsoft for WMT21 Shared Task EMNLP 2021

Small Model and In-Domain Data Are All You Need EMNLP 2021

Improving Span Representation for Domain-adapted Coreference Resolution EMNLP 2021

How to Train BERT with an Academic Budget EMNLP 2021

Will this Question be Answered? Question Filtering via Answer Model Distillation for Efficient Question Answering EMNLP 2021

Allocating Large Vocabulary Capacity for Cross-Lingual Language Model Pre-Training EMNLP 2021

RocketQAv2: A Joint Training Method for Dense Passage Retrieval and Passage Re-ranking EMNLP 2021

Exploring Non-Autoregressive Text Style Transfer EMNLP 2021

Samsung R&D Institute Poland submission to WAT 2021 Indic Language Multilingual Task ACL 2021