conftrace_

Fangmin Chen

4 papers · 2025–2026 · 4 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

🌍 Conference Polyglot (3) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (15)

Conferences

AAAI (1) AACL (1) ACL (1) IJCNLP (1)

Top co-authors

Songwei Liu (4) Shu Yang (3) Chao Zeng (3) Xing Mei (3) Lean Fu (2) Beichen Ning (1) Yusheng Xie (1) Xiaojian Wang (1) Chenqian Yan (1) Miao Wei (1)

Keywords

large language model (4) inference acceleration (3) model compression (3) structured sparsity (1) early stopping (1) sparse attention (1) gpu acceleration (1) mixed precision (1) long-context inference (1) weight compression (1) group quantization (1) arbitrary precision (1) online permutation (1) model quantization (1) post-training quantization (1)

Papers

S2O: Early Stopping for Sparse Attention via Online Permutation ACL 2026 ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models AAAI 2025 GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference AACL 2025 GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference IJCNLP 2025