Wencong Xiao
7 papers · 2017–2024 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+6 more ↓ Show less ↑
🌉 Interdisciplinary Bridge 🏃 Academic Marathon (7) 🧭 Keyword Pioneer 🌍 Conference Polyglot (4) 🐣 Hot Topic Early Bird
🧭
Keyword Pioneer
🌍
Conference Polyglot
(4)
🏃
Academic Marathon
(7)
🧬
Topic Evolution
📈
Trend Setter
🚀
Conference Pioneer
Conferences
OSDI (3)
NSDI (2)
AAAI (1)
CVPR (1)
Top co-authors
Keywords
cluster scheduling
(3)
gpu scheduling
(2)
model compression
(2)
gpu utilization
(2)
resource utilization
(2)
gpu cluster
(2)
deep learning
(2)
model serving
(1)
deep learning training
(1)
stochastic gradient descent
(1)
sparse model
(1)
sparsity pattern
(1)
convolutional neural network
(1)
inference optimization
(1)
gpu computing
(1)
gpu acceleration
(1)
machine learning
(1)
low-bit quantization
(1)
heterogeneous computing
(1)
parallel computing
(1)
Papers
Llumnix: Dynamic Scheduling for Large Language Model Serving
OSDI 2024
MLaaS in the Wild: Workload Analysis and Scheduling in Large-Scale Heterogeneous GPU Clusters
NSDI 2022
AntMan: Dynamic Scaling on GPU Clusters for Deep Learning
OSDI 2020
Balanced Sparsity for Efficient DNN Inference on GPU
AAAI 2019
SeerNet: Predicting Convolutional Neural Network Feature-Map Sparsity Through Low-Bit Quantization
CVPR 2019
Gandiva: Introspective Cluster Scheduling for Deep Learning
OSDI 2018
Tux²: Distributed Graph Computation for Machine Learning
NSDI 2017