Torsten Hoefler

21 papers · 2018–2026 · 11 conferences · across top CS/AI conferences

Achievements

+10 more ↓

🌉 Interdisciplinary Bridge 🏃 Academic Marathon (7) 🌈 Renaissance Researcher (6) 🌍 Conference Polyglot (10) 🗺️ Taxonomy Completionist (33)

🗺️ Taxonomy Completionist (33) 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 👑 Triple Crown 🏆 Grand Slam 🗃️ Keyword Collector (59) ⚡ Prolific Year (8) 📈 Trend Setter 🔥 Unstoppable (6) 💎 Century Club (20)

Conferences

ICLR (5) NIPS (5) ICML (2) NSDI (2) AAAI (1) ACL (1) AISTATS (1) CVPR (1) EMNLP (1) ICCV (1) JMLR (1)

Top co-authors

Dan Alistarh (6) Saleh Ashkboos (6) Tal Ben-Nun (5) Nikoli Dryden (4) Lukas Gianinazzi (3) Elias Frantar (3) Langwen Huang (3) Nils Blach (2) Daniele De Sensi (2) Maciej Besta (2)

Keywords

model compression (4) large language model (3) stochastic gradient descent (2) neural network pruning (2) distributed training (1) semantic analysis (1) non-convex optimization (1) edge deployment (1) efficient inference (1) graph representation (1) semantic representation (1) weather prediction (1) program analysis (1) spatial datum (1) ensemble forecast (1) mixture of expert (1) inference optimization (1) inference efficiency (1) distributed learning (1) representation learning (1)

Papers

Apertus: Democratizing Open and Compliant LLMs for Global Language Environments ACL 2026 All models are wrong, some are useful: Model Selection with Limited Labels AISTATS 2025 DiffDA: a Diffusion model for weather-scale Data Assimilation ICML 2024 QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs NIPS 2024 Graph of Thoughts: Solving Elaborate Problems with Large Language Models AAAI 2024 Swing: Short-cutting Rings for Higher Bandwidth Allreduce NSDI 2024 QUIK: Towards End-to-end 4-Bit Inference on Generative Large Language Models EMNLP 2024 SliceGPT: Compress Large Language Models by Deleting Rows and Columns ICLR 2024 SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression ICLR 2024 A High-Performance Design, Implementation, Deployment, and Evaluation of The Slim Fly Network NSDI 2024 Compressing multidimensional weather and climate data into neural networks ICLR 2023 OPTQ: Accurate Quantization for Generative Pre-trained Transformers ICLR 2023 Differentiable Transportation Pruning ICCV 2023 Neural Parameter Allocation Search ICLR 2022 Spatial Mixture-of-Experts NIPS 2022 ENS-10: A Dataset For Post-Processing Ensemble Weather Forecasts NIPS 2022 ProGraML: A Graph-based Program Representation for Data Flow Analysis and Compiler Optimizations ICML 2021 Sparsity in Deep Learning: Pruning and growth for efficient inference and training in neural networks JMLR 2021 Augment Your Batch: Improving Generalization Through Instance Repetition CVPR 2020 Neural Code Comprehension: A Learnable Representation of Code Semantics NIPS 2018 The Convergence of Sparsified Gradient Methods NIPS 2018