Niklas Muennighoff

35 papers · 2022–2025 · 11 conferences · across top CS/AI conferences

Achievements

+8 more ↓

🗺️ Taxonomy Completionist (39) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (11) 🐣 Hot Topic Early Bird

🐝 Cross-Pollinator (15) 🔬 Deep Specialist (13) 👑 Triple Crown 👥 Mega-Team (82) ❓ The Questioner (2) 🗃️ Keyword Collector (110) 💎 Century Club (35) ⚡ Prolific Year (11)

Conferences

ICLR (12) ACL (7) NIPS (5) EMNLP (4) COLING (1) CVPR (1) EACL (1) ICCV (1) ICML (1) JMLR (1) NAACL (1)

Top co-authors

Stella Biderman (6) Luca Soldaini (6) Dirk Groeneveld (5) Sara Hooker (5) Kyle Lo (5) Hannaneh Hajishirzi (5) Seungone Kim (4) Nouamane Tazi (4) Binyuan Hui (4) Qian Liu (4)

Keywords

large language model (10) multilingual language model (5) language model (4) benchmark evaluation (3) multilingual model (3) cross-lingual transfer (2) zero-shot generalization (2) data curation (2) model scaling (2) language modeling (2) model training (2) dataset collection (2) data repetition (2) scaling law (2) cross-modal retrieval (1) code generation (1) image captioning (1) multimodal learning (1) multilingual nlp (1) continued pretraining (1)

Papers

s1: Simple test-time scaling EMNLP 2025 MIEB: Massive Image Embedding Benchmark ICCV 2025 KMMLU: Measuring Massive Multitask Language Understanding in Korean NAACL 2025 SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains? ICLR 2025 Scaling Laws for Precision ICLR 2025 BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval ICLR 2025 MMTEB: Massive Multilingual Text Embedding Benchmark ICLR 2025 Bridging the Data Provenance Gap Across Text, Speech, and Video ICLR 2025 OpenHands: An Open Platform for AI Software Developers as Generalist Agents ICLR 2025 BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions ICLR 2025 Scaling Data-Constrained Language Models JMLR 2025 LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM Evaluation ACL 2025 Aurora-M: Open Source Continual Pre-training for Multilingual Language and Code COLING 2025 Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models CVPR 2025 OLMoE: Open Mixture-of-Experts Language Models ICLR 2025 Generative Representational Instruction Tuning ICLR 2025 Language models scale reliably with over-training and on downstream tasks ICLR 2025 RegMix: Data Mixture as Regression for Language Model Pre-training ICLR 2025 SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages EMNLP 2024 DataComp-LM: In search of the next generation of training sets for language models NIPS 2024 The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text Embedding NIPS 2024 Consent in Crisis: The Rapid Decline of the AI Data Commons NIPS 2024 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies NIPS 2024 Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning ACL 2024 Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research ACL 2024 OLMo: Accelerating the Science of Language Models ACL 2024 Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model ACL 2024 OctoPack: Instruction Tuning Code Large Language Models ICLR 2024 Model Alignment as Prospect Theoretic Optimization ICML 2024 FinGPT: Large Generative Models for a Small Language EMNLP 2023 MTEB: Massive Text Embedding Benchmark EACL 2023 BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting ACL 2023 Crosslingual Generalization through Multitask Finetuning ACL 2023 Scaling Data-Constrained Language Models NIPS 2023 What Language Model to Train if You Have One Million GPU Hours? EMNLP 2022