conftrace_

Niklas Muennighoff

35 papers · 2022–2025 · 11 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓
+8 more ↓ πŸ—ΊοΈ Taxonomy Completionist (39) 🧭 Keyword Pioneer πŸŒ‰ Interdisciplinary Bridge 🌍 Conference Polyglot (11) 🐣 Hot Topic Early Bird
🐝 Cross-Pollinator (15) πŸ”¬ Deep Specialist (13) πŸ‘‘ Triple Crown πŸ‘₯ Mega-Team (82) ❓ The Questioner (2) πŸ—ƒοΈ Keyword Collector (110) πŸ’Ž Century Club (35) ⚑ Prolific Year (11)

Conferences

ICLR (12) ACL (7) NIPS (5) EMNLP (4) COLING (1) CVPR (1) EACL (1) ICCV (1) ICML (1) JMLR (1) NAACL (1)

Papers

s1: Simple test-time scaling EMNLP 2025 MIEB: Massive Image Embedding Benchmark ICCV 2025 KMMLU: Measuring Massive Multitask Language Understanding in Korean NAACL 2025 SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains? ICLR 2025 Scaling Laws for Precision ICLR 2025 BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval ICLR 2025 MMTEB: Massive Multilingual Text Embedding Benchmark ICLR 2025 Bridging the Data Provenance Gap Across Text, Speech, and Video ICLR 2025 OpenHands: An Open Platform for AI Software Developers as Generalist Agents ICLR 2025 BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions ICLR 2025 Scaling Data-Constrained Language Models JMLR 2025 LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM Evaluation ACL 2025 Aurora-M: Open Source Continual Pre-training for Multilingual Language and Code COLING 2025 Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models CVPR 2025 OLMoE: Open Mixture-of-Experts Language Models ICLR 2025 Generative Representational Instruction Tuning ICLR 2025 Language models scale reliably with over-training and on downstream tasks ICLR 2025 RegMix: Data Mixture as Regression for Language Model Pre-training ICLR 2025 SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages EMNLP 2024 DataComp-LM: In search of the next generation of training sets for language models NIPS 2024 The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text Embedding NIPS 2024 Consent in Crisis: The Rapid Decline of the AI Data Commons NIPS 2024 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies NIPS 2024 Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning ACL 2024 Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research ACL 2024 OLMo: Accelerating the Science of Language Models ACL 2024 Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model ACL 2024 OctoPack: Instruction Tuning Code Large Language Models ICLR 2024 Model Alignment as Prospect Theoretic Optimization ICML 2024 FinGPT: Large Generative Models for a Small Language EMNLP 2023 MTEB: Massive Text Embedding Benchmark EACL 2023 BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting ACL 2023 Crosslingual Generalization through Multitask Finetuning ACL 2023 Scaling Data-Constrained Language Models NIPS 2023 What Language Model to Train if You Have One Million GPU Hours? EMNLP 2022