Ningxin Zheng
6 papers · 2022–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+1 more ↓ Show less ↑
π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (4) π Cross-Pollinator (12) πΊοΈ Taxonomy Completionist (10)
π£
Hot Topic Early Bird
Conferences
OSDI (3)
CVPR (1)
ICCV (1)
ICML (1)
Top co-authors
Keywords
hardware acceleration
(2)
model compression
(2)
attention mechanism
(1)
neural architecture search
(1)
efficient inference
(1)
efficient computing
(1)
deep neural network
(1)
memory efficiency
(1)
inference optimization
(1)
inference latency
(1)
model acceleration
(1)
cascaded attention
(1)
model sparsity
(1)
compiler optimization
(1)
latency optimization
(1)
low-precision computing
(1)
tensor transformation
(1)
data type optimization
(1)
tensor abstraction
(1)
operator optimization
(1)
Papers
ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference
ICML 2025
Ladder: Enabling Efficient Low-Precision Deep Learning Computing through Hardware-aware Tensor Transformation
OSDI 2024
EfficientViT: Memory Efficient Vision Transformer With Cascaded Group Attention
CVPR 2023
SpaceEvo: Hardware-Friendly Search Space Design for Efficient INT8 Inference
ICCV 2023
Optimizing Dynamic Neural Networks with Brainstorm
OSDI 2023
SparTA: Deep-Learning Model Sparsity via Tensor-with-Sparsity-Attribute
OSDI 2022