John Aslanides
6 papers · 2017–2022 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+3 more ↓ Show less ↑
π Cross-Pollinator (15) π Conference Polyglot (4) π Academic Marathon (5) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (17)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
β
The Questioner
Conferences
NIPS (3)
EMNLP (1)
ICLR (1)
IJCAI (1)
Top co-authors
Keywords
reinforcement learning
(2)
sequential decision making
(1)
prompt engineering
(1)
harmful content
(1)
trajectory optimization
(1)
model-based reinforcement learning
(1)
parametric model
(1)
partially observable environment
(1)
ensemble method
(1)
language model
(1)
reward model
(1)
uncertainty estimation
(1)
safety evaluation
(1)
credit assignment
(1)
red teaming
(1)
harmful content detection
(1)
human preference
(1)
experience replay
(1)
adversarial testing
(1)
model-based rl
(1)
Papers
Fine-tuning language models to find agreement among humans with diverse preferences
NIPS 2022
Red Teaming Language Models with Language Models
EMNLP 2022
Behaviour Suite for Reinforcement Learning
ICLR 2020
When to use parametric models in reinforcement learning?
NIPS 2019
Randomized Prior Functions for Deep Reinforcement Learning
NIPS 2018
Universal Reinforcement Learning Algorithms: Survey and Experiments
IJCAI 2017