Saleh Ashkboos
7 papers · 2021–2024 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓
π
Conference Polyglot
(3)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(11)
π§
Keyword Pioneer
π
Cross-Pollinator
(15)
Conferences
ICLR (4)
NIPS (2)
EMNLP (1)
Top co-authors
Keywords
large language model
(2)
model compression
(2)
ensemble forecast
(1)
inference optimization
(1)
inference efficiency
(1)
weight quantization
(1)
extreme event prediction
(1)
numerical weather prediction
(1)
4-bit quantization
(1)
activation quantization
(1)
gpu kernel
(1)
rotary transformation
(1)
4-bit inference
(1)
model quantization
(1)
Papers
SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression
ICLR 2024
QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs
NIPS 2024
QUIK: Towards End-to-end 4-Bit Inference on Generative Large Language Models
EMNLP 2024
SliceGPT: Compress Large Language Models by Deleting Rows and Columns
ICLR 2024
OPTQ: Accurate Quantization for Generative Pre-trained Transformers
ICLR 2023
ENS-10: A Dataset For Post-Processing Ensemble Weather Forecasts
NIPS 2022
New Bounds For Distributed Mean Estimation and Variance Reduction
ICLR 2021