Alexey Tumanov
9 papers · 2016–2025 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+3 more ↓ Show less ↑
🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🌍 Conference Polyglot (6) 🏃 Academic Marathon (9) 🐝 Cross-Pollinator (9)
🌈
Renaissance Researcher
(5)
🌉
Interdisciplinary Bridge
📈
Trend Setter
Conferences
OSDI (3)
ECCV (2)
ICLR (1)
ICML (1)
NIPS (1)
NSDI (1)
Top co-authors
Keywords
reinforcement learning
(1)
uncertainty quantification
(1)
distributed computing
(1)
model serving
(1)
cascade classifier
(1)
resource allocation
(1)
weight sharing
(1)
disease detection
(1)
memory efficiency
(1)
cost-aware learning
(1)
request scheduling
(1)
inference serving
(1)
latency-accuracy tradeoff
(1)
task parallel computation
(1)
actor model
(1)
fault tolerant system
(1)
performance predictability
(1)
large language model inference
(1)
batch scheduling
(1)
throughput latency tradeoff
(1)
Papers
SuperServe: Fine-Grained Inference Serving for Unpredictable Workloads
NSDI 2025
RocketKV: Accelerating Long-Context LLM Inference via Two-Stage KV Cache Compression
ICML 2025
DεpS: Delayed ε-Shrinking for Faster Once-For-All Training
ECCV 2024
Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-Serve
OSDI 2024
SuperFedNAS: Cost-Efficient Federated Neural Architecture Search for On-Device Inference
ECCV 2024
UnfoldML: Cost-Aware and Uncertainty-Based Dynamic 2D Prediction for Multi-Stage Classification
NIPS 2022
CompOFA – Compound Once-For-All Networks for Faster Multi-Platform Deployment
ICLR 2021
Ray: A Distributed Framework for Emerging AI Applications
OSDI 2018
Morpheus: Towards Automated SLOs for Enterprise Clusters
OSDI 2016