conftrace_

Huanran Zheng

4 papers · 2022–2025 · 2 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+1 more ↓

🌍 Conference Polyglot (2) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (14) 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird

🐝 Cross-Pollinator (15)

Conferences

EMNLP (3) ACL (1)

Top co-authors

Wei Zhu (3) Xiaoling Wang (3) pengfei wang (1) Yi Zhao (1) Jingfan Zhang (1) Dan Chen (1) Xing Tian (1)

Keywords

large language model (2) model compression (2) machine translation (1) computational efficiency (1) parameter efficient (1) model uncertainty (1) low-rank adaptation (1) mixture of expert (1) context window (1) kv cache compression (1) latency reduction (1) kv cache (1) speculative decoding (1) draft model (1) inference latency (1) model optimization (1) inference speed (1) translation quality (1) non-autoregressive translation (1) attention optimization (1)

Papers

Faster Speculative Decoding via Effective Draft Decoder with Pruned Candidate Tree ACL 2025 SCA: Selective Compression Attention for Efficiently Extending the Context Window of Large Language Models EMNLP 2024 MiLoRA: Efficient Mixture of Low-Rank Adaptation for Large Language Models Fine-tuning EMNLP 2024 Candidate Soups: Fusing Candidate Results Improves Translation Quality for Non-Autoregressive Translation EMNLP 2022