Liu Guoming
6 papers · 2025–2025 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+3 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (14) π§ Keyword Pioneer π Conference Polyglot (3) π Cross-Pollinator (14) π Interdisciplinary Bridge
β
Rising Star
(6)
β‘
Prolific Year
(6)
β
The Questioner
Conferences
ACL (3)
EMNLP (2)
ICML (1)
Top co-authors
Keywords
large language model
(4)
kv cache
(3)
inference efficiency
(2)
attention mechanism
(2)
model compression
(2)
memory efficiency
(1)
memory optimization
(1)
inference optimization
(1)
latency reduction
(1)
speculative decoding
(1)
kv cache quantization
(1)
attention weight
(1)
inference speed
(1)
prompt compression
(1)
long-context understanding
(1)
text compression
(1)
cache compression
(1)
codebook quantization
(1)
kv cache reduction
(1)
dynamic compression
(1)
Papers
KV-Latent: Dimensional-level KV Cache Reduction with Frequency-aware Rotary Positional Embedding
ACL 2025
DAC: A Dynamic Attention-aware Approach for Task-Agnostic Prompt Compression
ACL 2025
SpindleKV: A Novel KV Cache Reduction Method Balancing Both Shallow and Deep Layers
ACL 2025
XQuant: Achieving Ultra-Low Bit KV Cache Quantization with Cross-Layer Compression
EMNLP 2025
Faster In-Context Learning for LLMs via N-Gram Trie Speculative Decoding
EMNLP 2025
What Limits Bidirectional Modelβs Generative Capabilities? A Uni-Bi-Directional Mixture-of-Expert Method For Bidirectional Fine-tuning
ICML 2025