Lianmin Zheng
14 papers · 2018–2024 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
🏃 Academic Marathon (6) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (4) 🐝 Cross-Pollinator (12)
🌍
Conference Polyglot
(4)
🏃
Academic Marathon
(6)
🌈
Renaissance Researcher
(6)
👑
Triple Crown
🏆
Keyword Champion
(2)
📈
Trend Setter
💎
Century Club
(14)
⚡
Prolific Year
(5)
🗃️
Keyword Collector
(60)
🔥
Unstoppable
(5)
Conferences
NIPS (5)
ICML (4)
OSDI (4)
ICLR (1)
Top co-authors
Research topics
Keywords
large language model
(5)
inference optimization
(3)
kv cache
(2)
model compression
(2)
tensor program
(2)
neural network
(2)
deep learning
(2)
memory optimization
(2)
model parallelism
(2)
efficient inference
(1)
distributed training
(1)
structured output
(1)
deep learning model
(1)
convolutional neural network
(1)
program synthesis
(1)
benchmark evaluation
(1)
algorithm optimization
(1)
prompt engineering
(1)
evaluation benchmark
(1)
language model
(1)
Papers
Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference
ICML 2024
SGLang: Efficient Execution of Structured Language Model Programs
NIPS 2024
LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset
ICLR 2024
Towards Optimal Caching and Model Selection for Large Model Inference
NIPS 2023
AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving
OSDI 2023
FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU
ICML 2023
H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models
NIPS 2023
Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
NIPS 2023
GACT: Activation Compressed Training for Generic Network Architectures
ICML 2022
Alpa: Automating Inter- and Intra-Operator Parallelism for Distributed Deep Learning
OSDI 2022
ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training
ICML 2021
Ansor: Generating High-Performance Tensor Programs for Deep Learning
OSDI 2020
TVM: An Automated End-to-End Optimizing Compiler for Deep Learning
OSDI 2018
Learning to Optimize Tensor Programs
NIPS 2018