Papers
Sparse is Enough in Scaling Transformers
NIPS 2021
Channel Permutations for N:M Sparsity
NIPS 2021
Neural Routing by Memory
NIPS 2021