Jonah Brown-Cohen
5 papers · 2021–2024 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
🐝 Cross-Pollinator (4) 🧭 Keyword Pioneer 🌍 Conference Polyglot (3) 🐣 Hot Topic Early Bird 🌉 Interdisciplinary Bridge
🗺️
Taxonomy Completionist
(10)
👑
Triple Crown
Conferences
ICML (2)
NIPS (2)
ICLR (1)
Top co-authors
Keywords
deep reinforcement learning
(1)
policy optimization
(1)
semidefinite programming
(1)
model evaluation
(1)
ai safety
(1)
state representation
(1)
statistical estimation
(1)
online convex optimization
(1)
adversarial attack
(1)
scalable oversight
(1)
weak-to-strong generalization
(1)
multi-agent debate
(1)
human supervision
(1)
eigenvector computation
(1)
large language model
(1)
policy robustness
(1)
neural network
(1)
multi-agent system
(1)
robust decision making
(1)
debate protocol
(1)
Papers
On scalable oversight with weak LLMs judging strong LLMs
NIPS 2024
SKILL-MIX: a Flexible and Expandable Family of Evaluations for AI Models
ICLR 2024
Scalable AI Safety via Doubly-Efficient Debate
ICML 2024
Detecting Adversarial Directions in Deep Reinforcement Learning to Make Robust Decisions
ICML 2023
Faster Algorithms and Constant Lower Bounds for the Worst-Case Expected Error
NIPS 2021