Ying Sheng
12 papers · 2018–2025 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+6 more ↓ Show less ↑
π Conference Polyglot (5) π Renaissance Researcher (5) π Interdisciplinary Bridge π§ Keyword Pioneer π Academic Marathon (7)
πΊοΈ
Taxonomy Completionist
(19)
π
Conference Polyglot
(5)
π
Academic Marathon
(7)
π
Triple Crown
π
Century Club
(12)
β‘
Prolific Year
(5)
Conferences
NIPS (4)
ICML (3)
ICLR (2)
OSDI (2)
IJCAI (1)
Top co-authors
Keywords
large language model
(5)
inference optimization
(3)
kv cache
(2)
benchmark evaluation
(1)
efficient inference
(1)
model parallelism
(1)
prompt engineering
(1)
automated reasoning
(1)
low-rank approximation
(1)
matrix approximation
(1)
algorithm optimization
(1)
linear regression
(1)
structured output
(1)
low rank approximation
(1)
language model
(1)
batch processing
(1)
memory optimization
(1)
evaluation benchmark
(1)
formal methods
(1)
model selection
(1)
Papers
SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal
ICLR 2025
Fairness in Serving Large Language Models
OSDI 2024
SGLang: Efficient Execution of Structured Language Model Programs
NIPS 2024
Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference
ICML 2024
LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset
ICLR 2024
H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models
NIPS 2023
Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
NIPS 2023
Towards Optimal Caching and Model Selection for Large Model Inference
NIPS 2023
FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU
ICML 2023
AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving
OSDI 2023
Politeness for the Theory of Algebraic Datatypes (Extended Abstract)
IJCAI 2021
Subspace Embedding and Linear Regression with Orlicz Norm
ICML 2018