Matthijs T. J. Spaan
15 papers · 2005–2025 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+7 more ↓ Show less ↑
🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird 🌍 Conference Polyglot (7) 🏃 Academic Marathon (20) 🐝 Cross-Pollinator (13)
🌉
Interdisciplinary Bridge
🐣
Hot Topic Early Bird
🧬
Topic Evolution
🗃️
Keyword Collector
(53)
💎
Century Club
(15)
🔥
Unstoppable
(5)
🚀
Conference Pioneer
Conferences
ICML (4)
AAAI (3)
ICLR (2)
IJCAI (2)
RSS (2)
ACL (1)
JMLR (1)
Top co-authors
Keywords
reinforcement learning
(3)
safe policy improvement
(3)
sample complexity
(2)
sample efficiency
(1)
reinforcement learning theory
(1)
uncertainty quantification
(1)
structure learning
(1)
sequential decision making
(1)
bayesian regret
(1)
policy learning
(1)
distributional reinforcement learning
(1)
partially observable markov decision process
(1)
policy iteration
(1)
partially observable stochastic game
(1)
worst-case optimization
(1)
policy improvement
(1)
scalable training
(1)
monte carlo tree search
(1)
epistemic uncertainty
(1)
offline reinforcement learning
(1)
Papers
Trust-Region Twisted Policy Improvement
ICML 2025
Epistemic Monte Carlo Tree Search
ICLR 2025
Epistemic Bellman Operators
AAAI 2025
Positive Experience Reflection for Agents in Interactive Text Environments
ACL 2025
Scalable Safe Policy Improvement for Factored Multi-Agent MDPs
ICML 2024
Diverse Projection Ensembles for Distributional Reinforcement Learning
ICLR 2024
Scalable Safe Policy Improvement via Monte Carlo Tree Search
ICML 2023
Influence-Augmented Local Simulators: a Scalable Solution for Fast Deep RL in Large Networked Systems
ICML 2022
Safe Policies for Factored Partially Observable Stochastic Games
RSS 2021
WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning
AAAI 2021
Structure Learning for Safe Policy Improvement
IJCAI 2019
Safe Policy Improvement with Baseline Bootstrapping in Factored Environments
AAAI 2019
The MADP Toolbox: An Open Source Library for Planning and Learning in (Multi-)Agent Systems
JMLR 2017
Factored Upper Bounds for Multiagent Planning Problems under Uncertainty with Non-Factored Value Functions
IJCAI 2015
Robot Planning in Partially Observable Continuous Domains
RSS 2005