Tom Everitt
16 papers · 2017–2025 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+7 more ↓ Show less ↑
π Interdisciplinary Bridge π Conference Polyglot (6) π Academic Marathon (8) π Cross-Pollinator (13) πΊοΈ Taxonomy Completionist (21)
π§
Keyword Pioneer
π
Conference Polyglot
(6)
π
Grand Slam
π
Keyword Champion
π
Century Club
(16)
ποΈ
Keyword Collector
(66)
π₯
Unstoppable
(5)
Conferences
AAAI (7)
IJCAI (3)
ICML (2)
NIPS (2)
ICLR (1)
UAI (1)
Top co-authors
Keywords
causal inference
(8)
influence diagram
(4)
game theory
(4)
ai safety
(3)
markov decision process
(3)
inverse reinforcement learning
(2)
policy learning
(2)
causal influence diagram
(2)
value of information
(2)
agent incentive
(2)
agent system
(2)
graphical model
(2)
causal model
(2)
public policy
(1)
causal reasoning
(1)
algorithmic fairness
(1)
function approximation
(1)
optimistic exploration
(1)
causal discovery
(1)
maximum entropy
(1)
Papers
General agents need world models
ICML 2025
The Limits of Predicting Agents from Behaviour
ICML 2025
Reasoning about Causality in Games (Abstract Reprint)
AAAI 2024
Measuring Goal-Directedness
NIPS 2024
Robust agents learn causal world models
ICLR 2024
Discovering Agents (Abstract Reprint)
AAAI 2024
Human Control: Definitions and Algorithms
UAI 2023
Honesty Is the Best Policy: Defining and Mitigating AI Deception
NIPS 2023
Path-Specific Objectives for Safer Agent Incentives
AAAI 2022
Why Fair Labels Can Yield Unfair Predictions: Graphical Conditions for Introduced Unfairness
AAAI 2022
A Complete Criterion for Value of Information in Soluble Influence Diagrams
AAAI 2022
Agent Incentives: A Causal Perspective
AAAI 2021
How RL Agents Behave When Their Actions Are Modified
AAAI 2021
AGI Safety Literature Review
IJCAI 2018
Count-Based Exploration in Feature Space for Reinforcement Learning
IJCAI 2017
Reinforcement Learning with a Corrupted Reward Channel
IJCAI 2017