conftrace_

John Aslanides

6 papers · 2017–2022 · 4 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+3 more ↓

🐝 Cross-Pollinator (15) 🌍 Conference Polyglot (4) 🏃 Academic Marathon (5) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (17)

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird ❓ The Questioner

Conferences

NIPS (3) EMNLP (1) ICLR (1) IJCAI (1)

Top co-authors

Nat McAleese (2) Matteo Hessel (2) Ian Osband (2) Amelia Glaese (2) Albin Cassirer (1) Hado van Hasselt (1) Ethan Perez (1) Roman Ring (1) Trevor Cai (1) Yotam Doron (1)

Keywords

reinforcement learning (2) sequential decision making (1) prompt engineering (1) harmful content (1) trajectory optimization (1) model-based reinforcement learning (1) parametric model (1) partially observable environment (1) ensemble method (1) language model (1) reward model (1) uncertainty estimation (1) safety evaluation (1) credit assignment (1) red teaming (1) harmful content detection (1) human preference (1) experience replay (1) adversarial testing (1) model-based rl (1)

Papers

Fine-tuning language models to find agreement among humans with diverse preferences NIPS 2022 Red Teaming Language Models with Language Models EMNLP 2022 Behaviour Suite for Reinforcement Learning ICLR 2020 When to use parametric models in reinforcement learning? NIPS 2019 Randomized Prior Functions for Deep Reinforcement Learning NIPS 2018 Universal Reinforcement Learning Algorithms: Survey and Experiments IJCAI 2017