Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Keywords
post-training quantization
124 papers
Explore in graph
Also known as
PTQ
Co-occurring keywords
model compression
(3283)
model quantization
(279)
weight quantization
(133)
large language model
(12755)
activation quantization
(47)
neural network quantization
(122)
diffusion model
(3720)
neural network optimization
(1293)
efficient computing
(779)
vision transformer
(1091)
Papers
MPPQ: Enhancing Post-Training Quantization for LLMs via Mixed Supervision, Proxy Rounding, and Pre-Searching
IJCAI 2025
AMP-ViT: Optimizing Vision Transformer Efficiency with Adaptive Mixed-Precision Post-Training Quantization
WACV 2025
KurTail : Kurtosis-based LLM Quantization
EMNLP 2025
L4Q: Parameter Efficient Quantization-Aware Fine-Tuning on Large Language Models
ACL 2025
Q-VLM: Post-training Quantization for Large Vision-Language Models
NIPS 2024
MagR: Weight Magnitude Reduction for Enhancing Post-Training Quantization
NIPS 2024
Towards Next-Level Post-Training Quantization of Hyper-Scale Transformers
NIPS 2024
2DQuant: Low-bit Post-Training Quantization for Image Super-Resolution
NIPS 2024
QTIP: Quantization with Trellises and Incoherence Processing
NIPS 2024
PTQ4DiT: Post-training Quantization for Diffusion Transformers
NIPS 2024
LRQuant: Learnable and Robust Post-Training Quantization for Large Language Models
ACL 2024
Outlier Reduction with Gated Attention for Improved Post-training Quantization in Large Sequence-to-sequence Speech Foundation Models
INTERSPEECH 2024
Make RepVGG Greater Again: A Quantization-Aware Approach
AAAI 2024
Selective Focus: Investigating Semantics Sensitivity in Post-training Quantization for Lane Detection
AAAI 2024
PTMQ: Post-training Multi-Bit Quantization of Neural Networks
AAAI 2024
Towards Accurate Post-training Quantization for Diffusion Models
CVPR 2024
Instance-Aware Group Quantization for Vision Transformers
CVPR 2024
VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models
EMNLP 2024
TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models
CVPR 2024
Enhancing Post-training Quantization Calibration through Contrastive Learning
CVPR 2024
On the Way to Lossless Compression of Language Transformers: Exploring Cross-Domain Properties of Quantization
COLING 2024
When Quantization Affects Confidence of Large Language Models?
NAACL 2024
HyQ: Hardware-Friendly Post-Training Quantization for CNN-Transformer Hybrid Networks
IJCAI 2024
Improving Conversational Abilities of Quantized Large Language Models via Direct Preference Alignment
ACL 2024
Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs
EMNLP 2024
<
1
2
3
4
5
>