Yushi Huang
6 papers · 2024–2026 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓
🌍
Conference Polyglot
(3)
🌉
Interdisciplinary Bridge
🧭
Keyword Pioneer
🐝
Cross-Pollinator
(15)
Conferences
AAAI (2)
ACL (1)
CVPR (1)
EMNLP (1)
ICML (1)
Top co-authors
Keywords
model compression
(3)
long-context inference
(2)
model quantization
(2)
efficient computing
(1)
diffusion model
(1)
hidden state
(1)
vision-language model
(1)
token pruning
(1)
inference efficiency
(1)
weight quantization
(1)
sparse attention
(1)
latency reduction
(1)
inference acceleration
(1)
kv cache
(1)
temporal feature
(1)
mixed precision
(1)
token reduction
(1)
large language model
(1)
dynamic pruning
(1)
compression toolkit
(1)
Papers
LLMC+: Benchmarking Vision-Language Model Compression with a plug-and-play Toolkit
AAAI 2026
SlimInfer: Accelerating Long-Context LLM Inference via Dynamic Token Pruning
AAAI 2026
Focus-dLLM: Accelerating Long-Context Diffusion LLM Inference via Confidence-Guided Context Focusing
ACL 2026
HarmoniCa: Harmonizing Training and Inference for Better Feature Caching in Diffusion Transformer Acceleration
ICML 2025
TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models
CVPR 2024
LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit
EMNLP 2024