Nathan Lambert

11 papers · 2020–2025 · 8 conferences · across top CS/AI conferences

Achievements

+8 more ↓

🏃 Academic Marathon (5) 🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird 🌍 Conference Polyglot (8) 🐝 Cross-Pollinator (14)

🌍 Conference Polyglot (8) 🏃 Academic Marathon (5) 🌈 Renaissance Researcher (5) 👥 Mega-Team (50) 🗃️ Keyword Collector (63) 🚀 Conference Pioneer 💎 Century Club (11) ⚡ Prolific Year (5)

Conferences

ACL (3) NIPS (2) AISTATS (1) CVPR (1) ICLR (1) ICML (1) L4DC (1) NAACL (1)

Top co-authors

Hannaneh Hajishirzi (6) Dirk Groeneveld (4) Kyle Lo (4) Noah A. Smith (4) Luca Soldaini (4) Jacob Morrison (4) Niklas Muennighoff (4) Yejin Choi (3) Akshita Bhagia (3) Oyvind Tafjord (3)

Research topics

Reinforcement Learning (2)

Keywords

large language model (3) reward model (3) model-based reinforcement learning (2) multi-task learning (1) preference learning (1) direct preference optimization (1) visual question answering (1) model evaluation (1) language modeling (1) preference optimization (1) multimodal learning (1) image captioning (1) corpus construction (1) instruction following (1) web corpus (1) intent classification (1) language model alignment (1) neural network optimization (1) reinforcement learning from human feedback (1) multilingual nlp (1)

Papers

RewardBench: Evaluating Reward Models for Language Modeling NAACL 2025 Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models CVPR 2025 OLMoE: Open Mixture-of-Experts Language Models ICLR 2025 M-RewardBench: Evaluating Reward Models in Multilingual Settings ACL 2025 Position: Social Choice Should Guide AI Alignment in Dealing with Diverse Human Feedback ICML 2024 WildGuard: Open One-stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs NIPS 2024 Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback NIPS 2024 Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research ACL 2024 OLMo: Accelerating the Science of Language Models ACL 2024 On the Importance of Hyperparameter Optimization for Model-based Reinforcement Learning AISTATS 2021 Objective Mismatch in Model-based Reinforcement Learning L4DC 2020