Matthijs T. J. Spaan

15 papers · 2005–2025 · 7 conferences · across top CS/AI conferences

Achievements

+7 more ↓

🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird 🌍 Conference Polyglot (7) 🏃 Academic Marathon (20) 🐝 Cross-Pollinator (13)

🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird 🧬 Topic Evolution 🗃️ Keyword Collector (53) 💎 Century Club (15) 🔥 Unstoppable (5) 🚀 Conference Pioneer

Conferences

ICML (4) AAAI (3) ICLR (2) IJCAI (2) RSS (2) ACL (1) JMLR (1)

Top co-authors

Thiago D. Simão (5) Yaniv Oren (2) Federico Bianchi (2) Alberto Castellini (2) Alessandro Farinelli (2) Jinke He (2) Wendelin Boehmer (2) Edoardo Zorzi (2) Stefan John Witwicki (1) Sudarshanan Bharadwaj (1)

Keywords

reinforcement learning (3) safe policy improvement (3) sample complexity (2) sample efficiency (1) reinforcement learning theory (1) uncertainty quantification (1) structure learning (1) sequential decision making (1) bayesian regret (1) policy learning (1) distributional reinforcement learning (1) partially observable markov decision process (1) policy iteration (1) partially observable stochastic game (1) worst-case optimization (1) policy improvement (1) scalable training (1) monte carlo tree search (1) epistemic uncertainty (1) offline reinforcement learning (1)

Papers

Trust-Region Twisted Policy Improvement ICML 2025 Epistemic Monte Carlo Tree Search ICLR 2025 Epistemic Bellman Operators AAAI 2025 Positive Experience Reflection for Agents in Interactive Text Environments ACL 2025 Scalable Safe Policy Improvement for Factored Multi-Agent MDPs ICML 2024 Diverse Projection Ensembles for Distributional Reinforcement Learning ICLR 2024 Scalable Safe Policy Improvement via Monte Carlo Tree Search ICML 2023 Influence-Augmented Local Simulators: a Scalable Solution for Fast Deep RL in Large Networked Systems ICML 2022 Safe Policies for Factored Partially Observable Stochastic Games RSS 2021 WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning AAAI 2021 Structure Learning for Safe Policy Improvement IJCAI 2019 Safe Policy Improvement with Baseline Bootstrapping in Factored Environments AAAI 2019 The MADP Toolbox: An Open Source Library for Planning and Learning in (Multi-)Agent Systems JMLR 2017 Factored Upper Bounds for Multiagent Planning Problems under Uncertainty with Non-Factored Value Functions IJCAI 2015 Robot Planning in Partially Observable Continuous Domains RSS 2005