Co-occurring keywords
Papers
Scalable Efficient Training of Large Language Models with Low-dimensional Projected Attention
EMNLP 2024
Pruning before Fine-tuning: A Retraining-free Compression Framework for Pre-trained Language Models
COLING 2024
Probe Then Retrieve and Reason: Distilling Probing and Reasoning Capabilities into Smaller Language Models
COLING 2024
Multilingual Brain Surgeon: Large Language Models Can Be Compressed Leaving No Language behind
COLING 2024