Xianzhi Yu
7 papers · 2022–2026 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓
π
Conference Polyglot
(3)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(11)
π§
Keyword Pioneer
π
Cross-Pollinator
(15)
Conferences
ACL (3)
EMNLP (2)
ICML (1)
NIPS (1)
Top co-authors
Keywords
model compression
(3)
post-training quantization
(2)
efficient computing
(2)
neural network optimization
(1)
convolutional neural network
(1)
mixture of expert
(1)
sparse activation
(1)
feed-forward network
(1)
inference efficiency
(1)
speculative decoding
(1)
latency optimization
(1)
test-time scaling
(1)
sparse convolution
(1)
large language model
(1)
neural network
(1)
neural processing unit
(1)
batch inference
(1)
branch-wise parallelism
(1)
quantization sensitivity
(1)
floating-point format
(1)
Papers
Benchmarking Post-Training Quantization of Large Language Models under Microscaling Floating Point Formats
ACL 2026
Analytical FFN-to-MoE Restructuring via Activation Pattern Analysis
ACL 2026
Unleashing Low-Bit Inference on Ascend NPUs: A Comprehensive Evaluation of HiFloat Formats
ACL 2026
FlatQuant: Flatness Matters for LLM Quantization
ICML 2025
Faster and Better LLMs via Latency-Aware Test-Time Scaling
EMNLP 2025
Accelerating Sparse Convolution with Column Vector-Wise Sparsity
NIPS 2022
HW-TSCβs Submission for the WMT22 Efficiency Task
EMNLP 2022