Yizhou Shan
6 papers · 2018–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+3 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (12) π§ Keyword Pioneer π Conference Polyglot (4) π Academic Marathon (7) π Cross-Pollinator (3)
π
Renaissance Researcher
(5)
π
Interdisciplinary Bridge
π
Trend Setter
Conferences
OSDI (3)
ACL (1)
ICML (1)
NSDI (1)
Top co-authors
Keywords
efficient computing
(1)
model serving
(1)
token pruning
(1)
key-value cache
(1)
tail latency
(1)
virtual machine
(1)
bare-metal cloud
(1)
distributed system
(1)
resource utilization
(1)
disaggregated memory
(1)
asynchronous programming
(1)
thread scheduling
(1)
remote memory access
(1)
rust compiler
(1)
operating system
(1)
confidential computing
(1)
hardware security
(1)
gpu scheduling
(1)
attention sparsity
(1)
large language model
(1)
Papers
RaaS: Reasoning-Aware Attention Sparsity for Efficient LLM Reasoning
ACL 2025
EPIC: Efficient Position-Independent Caching for Serving Large Language Models
ICML 2025
Beehive: A Scalable Disaggregated Memory Runtime Exploiting Asynchrony of Multithreaded Programs
NSDI 2025
BlitzScale: Fast and Live Large Model Autoscaling with O(1) Host Caching
OSDI 2025
Core slicing: closing the gap between leaky confidential VMs and bare-metal cloud
OSDI 2023
LegoOS: A Disseminated, Distributed OS for Hardware Resource Disaggregation
OSDI 2018