conftrace_

knowledge distillation

3725 papers

Explore in graph

Also known as

KD

Co-occurring keywords

model compression (3302) large language model (13587) transfer learning (5449) domain adaptation (4595) representation learning (6206) neural network (6616) language model (4599) catastrophic forgetting (958) continual learning (1181) contrastive learning (4032)

Papers

CRKD: Enhanced Camera-Radar Object Detection with Cross-modality Knowledge Distillation CVPR 2024

WebUOT-1M: Advancing Deep Underwater Object Tracking with A Million-Scale Benchmark NIPS 2024

EM Distillation for One-step Diffusion Models NIPS 2024

DHA: Learning Decoupled-Head Attention from Transformer Checkpoints via Adaptive Heads Fusion NIPS 2024

Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging NIPS 2024

Fine-Grained Prototypes Distillation for Few-Shot Object Detection AAAI 2024

Self-chats from Large Language Models Make Small Emotional Support Chatbot Better ACL 2024

Small But Funny: A Feedback-Driven Approach to Humor Distillation ACL 2024

Multistage Collaborative Knowledge Distillation from a Large Language Model for Semi-Supervised Sequence Generation ACL 2024

D2LLM: Decomposed and Distilled Large Language Models for Semantic Search ACL 2024

Learning Better Representations From Less Data For Propositional Satisfiability NIPS 2024

Distilling Robustness into Natural Language Inference Models with Domain-Targeted Augmentation ACL 2024

SlimSAM: 0.1% Data Makes Segment Anything Slim NIPS 2024

Compact Language Models via Pruning and Knowledge Distillation NIPS 2024

DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception NIPS 2024

JiuZhang3.0: Efficiently Improving Mathematical Reasoning by Training Small Data Synthesis Models NIPS 2024

SciInstruct: a Self-Reflective Instruction Annotated Dataset for Training Scientific Language Models NIPS 2024

Teacher-Student Training for Debiasing: General Permutation Debiasing for Large Language Models ACL 2024

Flexible Weight Tuning and Weight Fusion Strategies for Continual Named Entity Recognition ACL 2024

Unsupervised Distractor Generation via Large Language Model Distilling and Counterfactual Contrastive Decoding ACL 2024

Incremental Sequence Labeling: A Tale of Two Shifts ACL 2024

LLM-QAT: Data-Free Quantization Aware Training for Large Language Models ACL 2024

Distilling Knowledge from Text-to-Image Generative Models Improves Visio-Linguistic Reasoning in CLIP EMNLP 2024

Teaching-Assistant-in-the-Loop: Improving Knowledge Distillation from Imperfect Teacher Models in Low-Budget Scenarios ACL 2024

RDRec: Rationale Distillation for LLM-based Recommendation ACL 2024