Peter Stone
83 papers · 2006–2026 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
π§ Keyword Pioneer π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (20) π Renaissance Researcher (5) π£ Hot Topic Early Bird
π
Interdisciplinary Bridge
π
Conference Polyglot
(11)
πΊοΈ
Taxonomy Completionist
(20)
π§¬
Topic Evolution
π±
Topic Pioneer
π
Keyword Champion
(3)
π
Grand Slam
π¬
Deep Specialist
(18)
π€
Dynamic Duo
(10)
π
Trend Setter
π
Conference Pioneer
π₯
Unstoppable
(11)
ποΈ
Keyword Collector
(60)
π
Century Club
(81)
β‘
Prolific Year
(12)
Conferences
AAAI (16)
IJCAI (15)
NIPS (15)
CORL (12)
ICML (8)
ICLR (5)
JMLR (5)
CVPR (2)
EMNLP (2)
RSS (2)
UAI (1)
Top co-authors
Research topics
Keywords
reinforcement learning
(13)
policy learning
(8)
deep reinforcement learning
(6)
transfer learning
(6)
robot manipulation
(5)
imitation learning
(5)
ad hoc teamwork
(5)
mean squared error
(4)
markov decision process
(4)
sample efficiency
(4)
behavior policy
(3)
reward function
(3)
goal-conditioned reinforcement learning
(3)
out-of-distribution generalization
(3)
human-robot interaction
(3)
off-policy evaluation
(3)
multi-task learning
(3)
temporal difference learning
(3)
multi-agent reinforcement learning
(3)
domain generalization
(3)
Papers
Out-of-Distribution Generalization with a SPARC: Racing 100 Unseen Vehicles with a Single Policy
AAAI 2026
The Essentials of AI for Life and Society: A Full-Scale AI Literacy Course Accessible to All
AAAI 2026
Hyperspherical Normalization for Scalable Deep Reinforcement Learning
ICML 2025
SLAC: Simulation-Pretrained Latent Action Space for Whole-Body Real-World RL
CORL 2025
MEReQ: Max-Ent Residual-Q Inverse RL for Sample-Efficient Alignment from Intervention
CORL 2025
SocialNav-SUB: Benchmarking VLMs for Scene Understanding in Social Robot Navigation
CORL 2025
Learning a Fast Mixing Exogenous Block MDP using a Single Trajectory
ICLR 2025
SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement Learning
ICLR 2025
Longhorn: State Space Models are Amortized Online Learners
ICLR 2025
Proto Successor Measure: Representing the Behavior Space of an RL Agent
ICML 2025
Argus: A Compact and Versatile Foundation Model for Vision
CVPR 2025
The Essentials of AI for Life and Society: An AI Literacy Course for the University Community
AAAI 2025
Deep Reinforcement Learning for Robotics: A Survey of Real-World Successes
AAAI 2025
ComposableNav: Instruction-Following Navigation in Dynamic Environments via Composable Diffusion
CORL 2025
Reward (Mis)design for Autonomous Driving (Abstract Reprint)
AAAI 2024
LaRS: Latent Reasoning Skills for Chain-of-Thought Reasoning
EMNLP 2024
Discovering Creative Behaviors through DUPLEX: Diverse Universal Features for Policy Exploration
NIPS 2024
Disentangled Unsupervised Skill Discovery for Efficient Hierarchical Reinforcement Learning
NIPS 2024
SkiLD: Unsupervised Skill Discovery Guided by Factor Interactions
NIPS 2024
N-agent Ad Hoc Teamwork
NIPS 2024
Data-Efficient Policy Evaluation Through Behavior Policy Search
JMLR 2024
Learning to Look: Seeking Information for Decision Making via Policy Factorization
CORL 2024
Sample Efficient Myopic Exploration Through Multitask Reinforcement Learning with Diverse Tasks
ICLR 2024
Learning Optimal Advantage from Preferences and Mistaking It for Reward
AAAI 2024
Building Minimal and Reusable Causal State Abstractions for Reinforcement Learning
AAAI 2024
Minimum Coverage Sets for Training Robust Ad Hoc Teamwork Agents
AAAI 2024
Causal Policy Gradient for Whole-Body Mobile Manipulation
RSS 2023
FAMO: Fast Adaptive Multitask Optimization
NIPS 2023
LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning
NIPS 2023
ELDEN: Exploration via Local Dependencies
NIPS 2023
f-Policy Gradients: A General Framework for Goal-Conditioned RL using f-Divergences
NIPS 2023
MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection
ICLR 2023
The Perils of Trial-and-Error Reward Design: Misdesign through Overfitting and Invalid Task Specifications
AAAI 2023
Metric Residual Network for Sample Efficient Goal-Conditioned Reinforcement Learning
AAAI 2023
Learning Generalizable Manipulation Policies with Object-Centric 3D Representations
CORL 2023
DMΒ²: Decentralized Multi-Agent Reinforcement Learning via Distribution Matching
AAAI 2023
STERLING: Self-Supervised Terrain Representation Learning from Unconstrained Robot Experience
CORL 2023
Motion Planning (In)feasibility Detection using a Prior Roadmap via Path and Cut Search
RSS 2023
Composing Efficient, Robust Tests for Policy Selection
UAI 2023
Dynamic Sparse Training for Deep Reinforcement Learning
IJCAI 2022
Value Function Decomposition for Iterative Design of Reinforcement Learning Agents
NIPS 2022
BOME! Bilevel Optimization Made Easy: A Simple First-Order Approach
NIPS 2022
VIOLA: Imitation Learning for Vision-Based Manipulation with Object Proposal Priors
CORL 2022
Learning to Correct Mistakes: Backjumping in Long-Horizon Task and Motion Planning
CORL 2022
Coopernaut: End-to-End Driving With Cooperative Perception for Networked Vehicles
CVPR 2022
Causal Dynamics Learning for Task-Independent State Abstraction
ICML 2022
Coach-Player Multi-agent Reinforcement Learning for Dynamic Team Composition
ICML 2021
Demonstration of the EMPATHIC Framework for Task Learning from Implicit Human Feedback
AAAI 2021
Goal Blending for Responsive Shared Autonomy in a Navigating Vehicle
AAAI 2021
Adversarial Intrinsic Motivation for Reinforcement Learning
NIPS 2021
Conflict-Averse Gradient Descent for Multi-task learning
NIPS 2021
Machine versus Human Attention in Deep Reinforcement Learning Tasks
NIPS 2021
Expected Value of Communication for Planning in Ad Hoc Teamwork
AAAI 2021
Temporal-Logic-Based Reward Shaping for Continuing Reinforcement Learning Tasks
AAAI 2021
Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey
JMLR 2020
Balancing Individual Preferences and Shared Objectives in Multiagent Reinforcement Learning
IJCAI 2020
A Penny for Your Thoughts: The Value of Communication in Ad Hoc Teamwork
IJCAI 2020
Reducing Sampling Error in Batch Temporal Difference Learning
ICML 2020
Firefly Neural Architecture Descent: a General Approach for Growing Neural Networks
NIPS 2020
Learning to Improve Multi-Robot Hallway Navigation
CORL 2020
The EMPATHIC Framework for Task Learning from Implicit Human Feedback
CORL 2020
An Imitation from Observation Approach to Transfer Learning with Dynamics Mismatch
NIPS 2020
Imitation Learning from Video by Leveraging Proprioception
IJCAI 2019
Leveraging Human Guidance for Deep Reinforcement Learning Tasks
IJCAI 2019
Importance Sampling Policy Evaluation with an Estimated Behavior Policy
ICML 2019
Selecting Compliant Agents for Opt-in Micro-Tolling
AAAI 2019
Ad Hoc Teamwork With Behavior Switching Agents
IJCAI 2019
Recent Advances in Imitation Learning from Observation
IJCAI 2019
Learning a Policy for Opportunistic Active Learning
EMNLP 2018
Multi-modal Predicate Identification using Dynamically Learned Robot Controllers
IJCAI 2018
Behavioral Cloning from Observation
IJCAI 2018
Opportunistic Active Learning for Grounding Natural Language Descriptions
CORL 2017
Autonomous Task Sequencing for Customized Curriculum Design in Reinforcement Learning
IJCAI 2017
Data-Efficient Policy Evaluation Through Behavior Policy Search
ICML 2017
Learning to Order Objects Using Haptic and Proprioceptive Exploratory Behaviors
IJCAI 2016
On the Analysis of Complex Backup Strategies in Monte Carlo Tree Search
ICML 2016
Robot Scavenger Hunt: A Standardized Framework for Evaluating Intelligent Mobile Robots
IJCAI 2016
Learning Multi-Modal Grounded Linguistic Semantics by Playing βI Spyβ
IJCAI 2016
When Security Games Go Green: Designing Defender Strategies to Prevent Poaching and Illegal Fishing
IJCAI 2015
Learning to Interpret Natural Language Commands through Human-Robot Dialog
IJCAI 2015
Transfer Learning for Reinforcement Learning Domains: A Survey
JMLR 2009
Transfer Learning via Inter-Task Mappings for Temporal Difference Learning
JMLR 2007
Evolutionary Function Approximation for Reinforcement Learning
JMLR 2006