Co-occurring keywords
Papers
From PEFT to DEFT: Parameter Efficient Finetuning for Reducing Activation Density in Transformers
AAAI 2025
Accurate Sublayer Pruning for Large Language Models by Exploiting Latency and Tunability Information
IJCAI 2025
QPruner: Probabilistic Decision Quantization for Structured Pruning in Large Language Models
NAACL 2025