Co-occurring keywords
Papers
Scalable Efficient Training of Large Language Models with Low-dimensional Projected Attention
EMNLP 2024
Unlocking Data-free Low-bit Quantization with Matrix Decomposition for KV Cache Compression
ACL 2024
SpaFL: Communication-Efficient Federated Learning With Sparse Models And Low Computational Overhead
NIPS 2024