Papers
Hash Layers For Large Sparse Models
NIPS 2021
Locality Sensitive Teaching
NIPS 2021
Adder Attention for Vision Transformer
NIPS 2021