conftrace_

Jonah Brown-Cohen

5 papers · 2021–2024 · 3 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+2 more ↓

🐝 Cross-Pollinator (4) 🧭 Keyword Pioneer 🌍 Conference Polyglot (3) 🐣 Hot Topic Early Bird 🌉 Interdisciplinary Bridge

🗺️ Taxonomy Completionist (10) 👑 Triple Crown

Conferences

ICML (2) NIPS (2) ICLR (1)

Top co-authors

Zachary Kenton (1) Ezgi Korkmaz (1) Arushi Gupta (1) János Kramár (1) Anirudh Goyal (1) Rohin Shah (1) Yunhao Tang (1) Noah D. Goodman (1) Jannis Bulian (1) Georgios Piliouras (1)

Keywords

deep reinforcement learning (1) policy optimization (1) semidefinite programming (1) model evaluation (1) ai safety (1) state representation (1) statistical estimation (1) online convex optimization (1) adversarial attack (1) scalable oversight (1) weak-to-strong generalization (1) multi-agent debate (1) human supervision (1) eigenvector computation (1) large language model (1) policy robustness (1) neural network (1) multi-agent system (1) robust decision making (1) debate protocol (1)

Papers

On scalable oversight with weak LLMs judging strong LLMs NIPS 2024 SKILL-MIX: a Flexible and Expandable Family of Evaluations for AI Models ICLR 2024 Scalable AI Safety via Doubly-Efficient Debate ICML 2024 Detecting Adversarial Directions in Deep Reinforcement Learning to Make Robust Decisions ICML 2023 Faster Algorithms and Constant Lower Bounds for the Worst-Case Expected Error NIPS 2021