Niklas Muennighoff
35 papers · 2022–2025 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (39) π§ Keyword Pioneer π Interdisciplinary Bridge π Conference Polyglot (11) π£ Hot Topic Early Bird
π
Cross-Pollinator
(15)
π¬
Deep Specialist
(13)
π
Triple Crown
π₯
Mega-Team
(82)
β
The Questioner
(2)
ποΈ
Keyword Collector
(110)
π
Century Club
(35)
β‘
Prolific Year
(11)
Conferences
ICLR (12)
ACL (7)
NIPS (5)
EMNLP (4)
COLING (1)
CVPR (1)
EACL (1)
ICCV (1)
ICML (1)
JMLR (1)
NAACL (1)
Top co-authors
Keywords
large language model
(10)
multilingual language model
(5)
language model
(4)
benchmark evaluation
(3)
multilingual model
(3)
cross-lingual transfer
(2)
zero-shot generalization
(2)
data curation
(2)
model scaling
(2)
language modeling
(2)
model training
(2)
dataset collection
(2)
data repetition
(2)
scaling law
(2)
cross-modal retrieval
(1)
code generation
(1)
image captioning
(1)
multimodal learning
(1)
multilingual nlp
(1)
continued pretraining
(1)
Papers
s1: Simple test-time scaling
EMNLP 2025
MIEB: Massive Image Embedding Benchmark
ICCV 2025
KMMLU: Measuring Massive Multitask Language Understanding in Korean
NAACL 2025
SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?
ICLR 2025
Scaling Laws for Precision
ICLR 2025
BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval
ICLR 2025
MMTEB: Massive Multilingual Text Embedding Benchmark
ICLR 2025
Bridging the Data Provenance Gap Across Text, Speech, and Video
ICLR 2025
OpenHands: An Open Platform for AI Software Developers as Generalist Agents
ICLR 2025
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
ICLR 2025
Scaling Data-Constrained Language Models
JMLR 2025
LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM Evaluation
ACL 2025
Aurora-M: Open Source Continual Pre-training for Multilingual Language and Code
COLING 2025
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models
CVPR 2025
OLMoE: Open Mixture-of-Experts Language Models
ICLR 2025
Generative Representational Instruction Tuning
ICLR 2025
Language models scale reliably with over-training and on downstream tasks
ICLR 2025
RegMix: Data Mixture as Regression for Language Model Pre-training
ICLR 2025
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages
EMNLP 2024
DataComp-LM: In search of the next generation of training sets for language models
NIPS 2024
The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text Embedding
NIPS 2024
Consent in Crisis: The Rapid Decline of the AI Data Commons
NIPS 2024
Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies
NIPS 2024
Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning
ACL 2024
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
ACL 2024
OLMo: Accelerating the Science of Language Models
ACL 2024
Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model
ACL 2024
OctoPack: Instruction Tuning Code Large Language Models
ICLR 2024
Model Alignment as Prospect Theoretic Optimization
ICML 2024
FinGPT: Large Generative Models for a Small Language
EMNLP 2023
MTEB: Massive Text Embedding Benchmark
EACL 2023
BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting
ACL 2023
Crosslingual Generalization through Multitask Finetuning
ACL 2023
Scaling Data-Constrained Language Models
NIPS 2023
What Language Model to Train if You Have One Million GPU Hours?
EMNLP 2022