Co-occurring keywords
Papers
Extracting Interpretable Task-Specific Circuits from Large Language Models for Faster Inference
AAAI 2025
BigMac: A Communication-Efficient Mixture-of-Experts Model Structure for Fast Training and Inference
AAAI 2025
From PEFT to DEFT: Parameter Efficient Finetuning for Reducing Activation Density in Transformers
AAAI 2025
Fit and Prune: Fast and Training-free Visual Token Pruning for Multi-modal Large Language Models
AAAI 2025
Hybrid Data-Free Knowledge Distillation
AAAI 2025