Torsten Hoefler
21 papers · 2018–2026 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
π Interdisciplinary Bridge π Academic Marathon (7) π Renaissance Researcher (6) π Conference Polyglot (10) πΊοΈ Taxonomy Completionist (33)
πΊοΈ
Taxonomy Completionist
(33)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π
Triple Crown
π
Grand Slam
ποΈ
Keyword Collector
(59)
β‘
Prolific Year
(8)
π
Trend Setter
π₯
Unstoppable
(6)
π
Century Club
(20)
Conferences
ICLR (5)
NIPS (5)
ICML (2)
NSDI (2)
AAAI (1)
ACL (1)
AISTATS (1)
CVPR (1)
EMNLP (1)
ICCV (1)
JMLR (1)
Top co-authors
Keywords
model compression
(4)
large language model
(3)
stochastic gradient descent
(2)
neural network pruning
(2)
distributed training
(1)
semantic analysis
(1)
non-convex optimization
(1)
edge deployment
(1)
efficient inference
(1)
graph representation
(1)
semantic representation
(1)
weather prediction
(1)
program analysis
(1)
spatial datum
(1)
ensemble forecast
(1)
mixture of expert
(1)
inference optimization
(1)
inference efficiency
(1)
distributed learning
(1)
representation learning
(1)
Papers
Apertus: Democratizing Open and Compliant LLMs for Global Language Environments
ACL 2026
All models are wrong, some are useful: Model Selection with Limited Labels
AISTATS 2025
DiffDA: a Diffusion model for weather-scale Data Assimilation
ICML 2024
QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs
NIPS 2024
Graph of Thoughts: Solving Elaborate Problems with Large Language Models
AAAI 2024
Swing: Short-cutting Rings for Higher Bandwidth Allreduce
NSDI 2024
QUIK: Towards End-to-end 4-Bit Inference on Generative Large Language Models
EMNLP 2024
SliceGPT: Compress Large Language Models by Deleting Rows and Columns
ICLR 2024
SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression
ICLR 2024
A High-Performance Design, Implementation, Deployment, and Evaluation of The Slim Fly Network
NSDI 2024
Compressing multidimensional weather and climate data into neural networks
ICLR 2023
OPTQ: Accurate Quantization for Generative Pre-trained Transformers
ICLR 2023
Differentiable Transportation Pruning
ICCV 2023
Neural Parameter Allocation Search
ICLR 2022
Spatial Mixture-of-Experts
NIPS 2022
ENS-10: A Dataset For Post-Processing Ensemble Weather Forecasts
NIPS 2022
ProGraML: A Graph-based Program Representation for Data Flow Analysis and Compiler Optimizations
ICML 2021
Sparsity in Deep Learning: Pruning and growth for efficient inference and training in neural networks
JMLR 2021
Augment Your Batch: Improving Generalization Through Instance Repetition
CVPR 2020
Neural Code Comprehension: A Learnable Representation of Code Semantics
NIPS 2018
The Convergence of Sparsified Gradient Methods
NIPS 2018