conftrace_

Tom Everitt

16 papers · 2017–2025 · 6 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+7 more ↓

🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (6) 🏃 Academic Marathon (8) 🐝 Cross-Pollinator (13) 🗺️ Taxonomy Completionist (21)

🧭 Keyword Pioneer 🌍 Conference Polyglot (6) 🏆 Grand Slam 🏆 Keyword Champion 💎 Century Club (16) 🗃️ Keyword Collector (66) 🔥 Unstoppable (5)

Conferences

AAAI (7) IJCAI (3) ICML (2) NIPS (2) ICLR (1) UAI (1)

Top co-authors

Ryan Carey (6) Jonathan Richens (4) Eric D. Langlois (2) Shane Legg (2) Matt MacDermott (2) Francesco Belardinelli (2) Sebastian Farquhar (2) Marcus Hutter (2) James Fox (2) Francis Ward (1)

Keywords

causal inference (8) influence diagram (4) game theory (4) ai safety (3) markov decision process (3) inverse reinforcement learning (2) policy learning (2) causal influence diagram (2) value of information (2) agent incentive (2) agent system (2) graphical model (2) causal model (2) public policy (1) causal reasoning (1) algorithmic fairness (1) function approximation (1) optimistic exploration (1) causal discovery (1) maximum entropy (1)

Papers

General agents need world models ICML 2025 The Limits of Predicting Agents from Behaviour ICML 2025 Reasoning about Causality in Games (Abstract Reprint) AAAI 2024 Measuring Goal-Directedness NIPS 2024 Robust agents learn causal world models ICLR 2024 Discovering Agents (Abstract Reprint) AAAI 2024 Human Control: Definitions and Algorithms UAI 2023 Honesty Is the Best Policy: Defining and Mitigating AI Deception NIPS 2023 Path-Specific Objectives for Safer Agent Incentives AAAI 2022 Why Fair Labels Can Yield Unfair Predictions: Graphical Conditions for Introduced Unfairness AAAI 2022 A Complete Criterion for Value of Information in Soluble Influence Diagrams AAAI 2022 Agent Incentives: A Causal Perspective AAAI 2021 How RL Agents Behave When Their Actions Are Modified AAAI 2021 AGI Safety Literature Review IJCAI 2018 Count-Based Exploration in Feature Space for Reinforcement Learning IJCAI 2017 Reinforcement Learning with a Corrupted Reward Channel IJCAI 2017