Model Compression
1674 directly classified papers
Papers per year
Papers
Quadapter: Adapter for GPT-2 Quantization
EMNLP 2022
Low-bit Shift Network for End-to-End Spoken Language Understanding
INTERSPEECH 2022
Channel Permutations for N:M Sparsity
NIPS 2021
On-Device Streaming Transformer-Based End-to-End Speech Recognition
INTERSPEECH 2021