Huanran Zheng
4 papers · 2022–2025 · 2 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+1 more ↓ Show less ↑
π Conference Polyglot (2) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (14) π§ Keyword Pioneer π£ Hot Topic Early Bird
π
Cross-Pollinator
(15)
Conferences
EMNLP (3)
ACL (1)
Top co-authors
Keywords
large language model
(2)
model compression
(2)
machine translation
(1)
computational efficiency
(1)
parameter efficient
(1)
model uncertainty
(1)
low-rank adaptation
(1)
mixture of expert
(1)
context window
(1)
kv cache compression
(1)
latency reduction
(1)
kv cache
(1)
speculative decoding
(1)
draft model
(1)
inference latency
(1)
model optimization
(1)
inference speed
(1)
translation quality
(1)
non-autoregressive translation
(1)
attention optimization
(1)
Papers
Faster Speculative Decoding via Effective Draft Decoder with Pruned Candidate Tree
ACL 2025
SCA: Selective Compression Attention for Efficiently Extending the Context Window of Large Language Models
EMNLP 2024
MiLoRA: Efficient Mixture of Low-Rank Adaptation for Large Language Models Fine-tuning
EMNLP 2024
Candidate Soups: Fusing Candidate Results Improves Translation Quality for Non-Autoregressive Translation
EMNLP 2022