Tim Dettmers

17 papers · 2018–2025 · 5 conferences · across top CS/AI conferences

Achievements

+9 more ↓

🏃 Academic Marathon (7) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (5) 🐝 Cross-Pollinator (11)

🏃 Academic Marathon (7) 🗺️ Taxonomy Completionist (31) 🌈 Renaissance Researcher (5) 👥 Mega-Team (24) 🧬 Topic Evolution 💎 Century Club (17) 🔥 Unstoppable (6) 🗃️ Keyword Collector (75) ⚡ Prolific Year (7)

Conferences

NIPS (6) ICLR (4) ICML (3) ACL (2) EMNLP (2)

Top co-authors

Luke Zettlemoyer (7) Alexander Borzunov (4) Younes Belkada (3) Mike Lewis (3) Ali Farhadi (3) Dmitry Baranchuk (2) Sewon Min (2) Max Ryabinin (2) Weijia Shi (2) Jacob Morrison (2)

Keywords

large language model (6) model compression (4) efficient computing (3) distributed learning (2) knowledge distillation (2) model quantization (2) embedding space (1) neural network training (1) transformer architecture (1) information retrieval (1) language modeling (1) distributed computing (1) network pruning (1) natural language understanding (1) question answering (1) model parallelism (1) neural network optimization (1) machine reading comprehension (1) natural language inference (1) link prediction (1)

Papers

Holistically Evaluating the Environmental Impact of Creating Language Models ICLR 2025 OLMoE: Open Mixture-of-Experts Language Models ICLR 2025 SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression ICLR 2024 Scaling Retrieval-Based Language Models with a Trillion-Token Datastore NIPS 2024 MatFormer: Nested Transformer for Elastic Inference NIPS 2024 Stable and low-precision training for large-scale vision-language models NIPS 2023 SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient ICML 2023 Petals: Collaborative Inference and Fine-tuning of Large Models ACL 2023 The case for 4-bit precision: k-bit Inference Scaling Laws ICML 2023 Towards A Unified View of Sparse Feed-Forward Network in Pretraining Large Language Model EMNLP 2023 Distributed Inference and Fine-tuning of Large Language Models Over The Internet NIPS 2023 QLoRA: Efficient Finetuning of Quantized LLMs NIPS 2023 GPT3.int8(): 8-bit Matrix Multiplication for Transformers at Scale NIPS 2022 8-bit Optimizers via Block-wise Quantization ICLR 2022 BASE Layers: Simplifying Training of Large, Sparse Models ICML 2021 High Performance Natural Language Processing EMNLP 2020 Jack the Reader – A Machine Reading Framework ACL 2018