Co-occurring keywords
Papers
Low-bit Shift Network for End-to-End Spoken Language Understanding
INTERSPEECH 2022
Optimal Brain Compression: A Framework for Accurate Post-Training Quantization and Pruning
NIPS 2022
Who Says Elephants Can’t Run: Bringing Large Scale MoE Models into Cloud Scale Production
EMNLP 2022
Low-complex and Highly-performed Binary Residual Neural Network for Small-footprint Keyword Spotting
INTERSPEECH 2022
Accelerating Inference and Language Model Fusion of Recurrent Neural Network Transducers via End-to-End 4-bit Quantization
INTERSPEECH 2022
Sub-8-Bit Quantization Aware Training for 8-Bit Neural Network Accelerator with On-Device Speech Recognition
INTERSPEECH 2022