Zhuoming Chen
9 papers · 2022–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+4 more ↓ Show less ↑
π£ Hot Topic Early Bird π Conference Polyglot (4) π Cross-Pollinator (13) πΊοΈ Taxonomy Completionist (18) π Interdisciplinary Bridge
π§
Keyword Pioneer
π
Keyword Champion
(2)
β‘
Prolific Year
(5)
β
The Questioner
Conferences
NIPS (5)
ICLR (2)
ICML (1)
INTERSPEECH (1)
Top co-authors
Keywords
token generation
(2)
speculative decoding
(2)
large language model
(2)
efficient computing
(1)
model serving
(1)
dynamic programming
(1)
gradient boosting
(1)
distributed training
(1)
model inference
(1)
sparse model
(1)
batch processing
(1)
language model
(1)
linear mixed-effects model
(1)
memory optimization
(1)
inference efficiency
(1)
contextual sparsity
(1)
latency reduction
(1)
token correction
(1)
inference acceleration
(1)
consumer gpu
(1)
Papers
MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding
ICLR 2025
MagicPIG: LSH Sampling for Efficient LLM Generation
ICLR 2025
GSM-$β$: How Do your LLMs Behave over Infinitely Increasing Reasoning Complexity and Context Length?
ICML 2025
Mini-Sequence Transformers: Optimizing Intermediate Memory for Long Sequences Training
NIPS 2024
Sequoia: Scalable and Robust Speculative Decoding
NIPS 2024
Acoustic changes in speech prosody produced by children with autism after robot-assisted speech training
INTERSPEECH 2024
SpecExec: Massively Parallel Speculative Decoding For Interactive LLM Inference on Consumer Devices
NIPS 2024
SIRIUS : Contexual Sparisty with Correction for Efficient LLMs
NIPS 2024
Quantized Training of Gradient Boosting Decision Trees
NIPS 2022