Shijie Cao
7 papers · 2019–2025 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+5 more ↓ Show less ↑
π£ Hot Topic Early Bird π Renaissance Researcher (5) π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (5)
π
Academic Marathon
(6)
π
Cross-Pollinator
(13)
πΊοΈ
Taxonomy Completionist
(19)
π
Conference Pioneer
π
Trend Setter
Conferences
ACL (3)
AAAI (1)
CVPR (1)
INTERSPEECH (1)
OSDI (1)
Top co-authors
Keywords
model compression
(6)
large language model
(3)
efficient inference
(2)
weight quantization
(2)
knowledge distillation
(2)
inference optimization
(2)
inference acceleration
(2)
gpu computing
(1)
deep learning
(1)
neural network pruning
(1)
edge computing
(1)
deep neural network
(1)
inference efficiency
(1)
edge inference
(1)
convolutional neural network
(1)
gpu acceleration
(1)
low-bit quantization
(1)
hardware acceleration
(1)
sparsity pattern
(1)
sparse model
(1)
Papers
Bitnet.cpp: Efficient Edge Inference for Ternary LLMs
ACL 2025
BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-Distillation
ACL 2024
AFPQ: Asymmetric Floating Point Quantization for LLMs
ACL 2024
Ladder: Enabling Efficient Low-Precision Deep Learning Computing through Hardware-aware Tensor Transformation
OSDI 2024
Accurate and Structured Pruning for Efficient Automatic Speech Recognition
INTERSPEECH 2023
Balanced Sparsity for Efficient DNN Inference on GPU
AAAI 2019
SeerNet: Predicting Convolutional Neural Network Feature-Map Sparsity Through Low-Bit Quantization
CVPR 2019