Quanlu Zhang
11 papers · 2018–2024 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+6 more ↓ Show less ↑
🏃 Academic Marathon (6) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (5) 🐣 Hot Topic Early Bird
🌍
Conference Polyglot
(5)
🏃
Academic Marathon
(6)
🐣
Hot Topic Early Bird
📈
Trend Setter
🗃️
Keyword Collector
(50)
💎
Century Club
(11)
Conferences
OSDI (6)
ICCV (2)
COLING (1)
CVPR (1)
NIPS (1)
Top co-authors
Keywords
model compression
(3)
neural architecture search
(3)
deep learning training
(2)
latency optimization
(2)
gpu scheduling
(2)
hardware acceleration
(2)
knowledge distillation
(1)
face detection
(1)
neural network training
(1)
resource allocation
(1)
neural network model
(1)
model parallelization
(1)
deep neural network
(1)
context length
(1)
mobile deployment
(1)
matrix factorization
(1)
inference latency
(1)
inference efficiency
(1)
weight pruning
(1)
deep learning
(1)
Papers
You Only Cache Once: Decoder-Decoder Architectures for Language Models
NIPS 2024
Ladder: Enabling Efficient Low-Precision Deep Learning Computing through Hardware-aware Tensor Transformation
OSDI 2024
nnScaler: Constraint-Guided Parallelization Plan Generation for Deep Learning Training
OSDI 2024
SpaceEvo: Hardware-Friendly Search Space Design for Efficient INT8 Inference
ICCV 2023
ElasticViT: Conflict-aware Supernet Training for Deploying Fast Vision Transformer on Diverse Mobile Devices
ICCV 2023
Privacy-Preserving Online AutoML for Domain-Specific Face Detection
CVPR 2022
SparTA: Deep-Learning Model Sparsity via Tensor-with-Sparsity-Attribute
OSDI 2022
Retiarii: A Deep Learning Exploratory-Training Framework
OSDI 2020
LadaBERT: Lightweight Adaptation of BERT through Hybrid Model Compression
COLING 2020
HiveD: Sharing a GPU Cluster for Deep Learning with Guarantees
OSDI 2020
Gandiva: Introspective Cluster Scheduling for Deep Learning
OSDI 2018