Nathan Lambert
11 papers · 2020–2025 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
🏃 Academic Marathon (5) 🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird 🌍 Conference Polyglot (8) 🐝 Cross-Pollinator (14)
🌍
Conference Polyglot
(8)
🏃
Academic Marathon
(5)
🌈
Renaissance Researcher
(5)
👥
Mega-Team
(50)
🗃️
Keyword Collector
(63)
🚀
Conference Pioneer
💎
Century Club
(11)
⚡
Prolific Year
(5)
Conferences
ACL (3)
NIPS (2)
AISTATS (1)
CVPR (1)
ICLR (1)
ICML (1)
L4DC (1)
NAACL (1)
Top co-authors
Research topics
Keywords
large language model
(3)
reward model
(3)
model-based reinforcement learning
(2)
multi-task learning
(1)
preference learning
(1)
direct preference optimization
(1)
visual question answering
(1)
model evaluation
(1)
language modeling
(1)
preference optimization
(1)
multimodal learning
(1)
image captioning
(1)
corpus construction
(1)
instruction following
(1)
web corpus
(1)
intent classification
(1)
language model alignment
(1)
neural network optimization
(1)
reinforcement learning from human feedback
(1)
multilingual nlp
(1)
Papers
RewardBench: Evaluating Reward Models for Language Modeling
NAACL 2025
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models
CVPR 2025
OLMoE: Open Mixture-of-Experts Language Models
ICLR 2025
M-RewardBench: Evaluating Reward Models in Multilingual Settings
ACL 2025
Position: Social Choice Should Guide AI Alignment in Dealing with Diverse Human Feedback
ICML 2024
WildGuard: Open One-stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs
NIPS 2024
Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback
NIPS 2024
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
ACL 2024
OLMo: Accelerating the Science of Language Models
ACL 2024
On the Importance of Hyperparameter Optimization for Model-based Reinforcement Learning
AISTATS 2021
Objective Mismatch in Model-based Reinforcement Learning
L4DC 2020