conftrace_

Liu Guoming

6 papers · 2025–2025 · 3 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+3 more ↓

🗺️ Taxonomy Completionist (14) 🧭 Keyword Pioneer 🌍 Conference Polyglot (3) 🐝 Cross-Pollinator (14) 🌉 Interdisciplinary Bridge

⭐ Rising Star (6) ⚡ Prolific Year (6) ❓ The Questioner

Conferences

ACL (3) EMNLP (2) ICML (1)

Top co-authors

Zuchao Li (6) Baoyuan Qi (6) Hai Zhao (5) Lefei Zhang (3) Ping Wang (3) Shi Luohe (2) Qiwei Li (2) Haoqi Yang (1) Haojun Ai (1) Yao Yao (1)

Keywords

large language model (4) kv cache (3) inference efficiency (2) attention mechanism (2) model compression (2) memory efficiency (1) memory optimization (1) inference optimization (1) latency reduction (1) speculative decoding (1) kv cache quantization (1) attention weight (1) inference speed (1) prompt compression (1) long-context understanding (1) text compression (1) cache compression (1) codebook quantization (1) kv cache reduction (1) dynamic compression (1)

Papers

KV-Latent: Dimensional-level KV Cache Reduction with Frequency-aware Rotary Positional Embedding ACL 2025 DAC: A Dynamic Attention-aware Approach for Task-Agnostic Prompt Compression ACL 2025 SpindleKV: A Novel KV Cache Reduction Method Balancing Both Shallow and Deep Layers ACL 2025 XQuant: Achieving Ultra-Low Bit KV Cache Quantization with Cross-Layer Compression EMNLP 2025 Faster In-Context Learning for LLMs via N-Gram Trie Speculative Decoding EMNLP 2025 What Limits Bidirectional Model’s Generative Capabilities? A Uni-Bi-Directional Mixture-of-Expert Method For Bidirectional Fine-tuning ICML 2025