Model Compression
1674 directly classified papers
Papers per year
Papers
Block Pruning For Faster Transformers
EMNLP 2021
Learning Compact Metrics for MT
EMNLP 2021
Mirror Descent View for Neural Network Quantization
AISTATS 2021