conftrace_

Zhuoming Chen

9 papers · 2022–2025 · 4 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+4 more ↓

🐣 Hot Topic Early Bird 🌍 Conference Polyglot (4) 🐝 Cross-Pollinator (13) 🗺️ Taxonomy Completionist (18) 🌉 Interdisciplinary Bridge

🧭 Keyword Pioneer 🏆 Keyword Champion (2) ⚡ Prolific Year (5) ❓ The Questioner

Conferences

NIPS (5) ICLR (2) ICML (1) INTERSPEECH (1)

Top co-authors

Beidi Chen (7) Yang Zhou (3) Avner May (3) Zhihao Jia (3) Max Ryabinin (2) Ranajoy Sadhukhan (2) Ruslan Svirschevski (2) Yuandong Tian (2) James Cheung (1) Yitian Hong (1)

Keywords

token generation (2) speculative decoding (2) large language model (2) efficient computing (1) model serving (1) dynamic programming (1) gradient boosting (1) distributed training (1) model inference (1) sparse model (1) batch processing (1) language model (1) linear mixed-effects model (1) memory optimization (1) inference efficiency (1) contextual sparsity (1) latency reduction (1) token correction (1) inference acceleration (1) consumer gpu (1)

Papers

MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding ICLR 2025 MagicPIG: LSH Sampling for Efficient LLM Generation ICLR 2025 GSM-$∞$: How Do your LLMs Behave over Infinitely Increasing Reasoning Complexity and Context Length? ICML 2025 Mini-Sequence Transformers: Optimizing Intermediate Memory for Long Sequences Training NIPS 2024 Sequoia: Scalable and Robust Speculative Decoding NIPS 2024 Acoustic changes in speech prosody produced by children with autism after robot-assisted speech training INTERSPEECH 2024 SpecExec: Massively Parallel Speculative Decoding For Interactive LLM Inference on Consumer Devices NIPS 2024 SIRIUS : Contexual Sparisty with Correction for Efficient LLMs NIPS 2024 Quantized Training of Gradient Boosting Decision Trees NIPS 2022