Stephen Mcaleer
14 papers · 2019–2024 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+7 more ↓ Show less ↑
🏃 Academic Marathon (5) 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🌍 Conference Polyglot (5) 🐝 Cross-Pollinator (9)
🏃
Academic Marathon
(5)
🐣
Hot Topic Early Bird
🐝
Cross-Pollinator
(9)
🏆
Grand Slam
🗃️
Keyword Collector
(52)
💎
Century Club
(14)
🔥
Unstoppable
(6)
Conferences
NIPS (8)
ICML (2)
IJCAI (2)
AAAI (1)
ICLR (1)
Top co-authors
Keywords
game theory
(7)
nash equilibrium
(4)
multi-agent reinforcement learning
(3)
mechanism design
(3)
reinforcement learning
(3)
policy optimization
(2)
extensive-form game
(2)
double oracle
(2)
deep reinforcement learning
(2)
temporal-difference learning
(1)
multi-agent learning
(1)
markov decision process
(1)
autonomous vehicle
(1)
value estimation
(1)
reinforcement learning from human feedback
(1)
ensemble method
(1)
zero-sum game
(1)
bilevel optimization
(1)
prompt engineering
(1)
offline reinforcement learning
(1)
Papers
Policy Space Response Oracles: A Survey
IJCAI 2024
Scalable Mechanism Design for Multi-Agent Path Finding
IJCAI 2024
Automated Design of Affine Maximizer Mechanisms in Dynamic Settings
AAAI 2024
Team-PSRO for Learning Approximate TMECor in Large Team Games via Cooperative Reinforcement Learning
NIPS 2023
Policy Space Diversity for Non-Transitive Games
NIPS 2023
Computing Optimal Equilibria and Mechanisms via Learning in Zero-Sum Extensive-Form Games
NIPS 2023
Language Models can Solve Computer Tasks
NIPS 2023
Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
ICML 2022
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning
NIPS 2022
XDO: A Double Oracle Algorithm for Extensive-Form Games
NIPS 2021
Neural Auto-Curricula in Two-Player Zero-Sum Games
NIPS 2021
Evolutionary Reinforcement Learning for Sample-Efficient Multiagent Coordination
ICML 2020
Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games
NIPS 2020
Solving the Rubik's Cube with Approximate Policy Iteration
ICLR 2019