conftrace_

Ningxin Zheng

6 papers · 2022–2025 · 4 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+1 more ↓

🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (4) 🐝 Cross-Pollinator (12) 🗺️ Taxonomy Completionist (10)

🐣 Hot Topic Early Bird

Conferences

OSDI (3) CVPR (1) ICCV (1) ICML (1)

Top co-authors

Yuqing Yang (5) Fan Yang (3) Lingxiao Ma (3) Quanlu Zhang (3) Mao Yang (3) Lidong Zhou (2) Ting Cao (2) Jilong Xue (2) Lingji Ouyang (1) Haisheng Tan (1)

Keywords

hardware acceleration (2) model compression (2) attention mechanism (1) neural architecture search (1) efficient inference (1) efficient computing (1) deep neural network (1) memory efficiency (1) inference optimization (1) inference latency (1) model acceleration (1) cascaded attention (1) model sparsity (1) compiler optimization (1) latency optimization (1) low-precision computing (1) tensor transformation (1) data type optimization (1) tensor abstraction (1) operator optimization (1)

Papers

ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference ICML 2025 Ladder: Enabling Efficient Low-Precision Deep Learning Computing through Hardware-aware Tensor Transformation OSDI 2024 EfficientViT: Memory Efficient Vision Transformer With Cascaded Group Attention CVPR 2023 SpaceEvo: Hardware-Friendly Search Space Design for Efficient INT8 Inference ICCV 2023 Optimizing Dynamic Neural Networks with Brainstorm OSDI 2023 SparTA: Deep-Learning Model Sparsity via Tensor-with-Sparsity-Attribute OSDI 2022