Fangmin Chen
4 papers · 2025–2026 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓
🌍
Conference Polyglot
(3)
🌉
Interdisciplinary Bridge
🧭
Keyword Pioneer
🐝
Cross-Pollinator
(15)
Conferences
AAAI (1)
AACL (1)
ACL (1)
IJCNLP (1)
Top co-authors
Keywords
large language model
(4)
inference acceleration
(3)
model compression
(3)
structured sparsity
(1)
early stopping
(1)
sparse attention
(1)
gpu acceleration
(1)
mixed precision
(1)
long-context inference
(1)
weight compression
(1)
group quantization
(1)
arbitrary precision
(1)
online permutation
(1)
model quantization
(1)
post-training quantization
(1)
Papers
S2O: Early Stopping for Sparse Attention via Online Permutation
ACL 2026
ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models
AAAI 2025
GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference
AACL 2025
GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference
IJCNLP 2025