Pieter Abbeel
216 papers · 2005–2026 · 15 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+19 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (41) π§ Keyword Pioneer π Renaissance Researcher (7) π Interdisciplinary Bridge π£ Hot Topic Early Bird
π§
Keyword Pioneer
π
Renaissance Researcher
(7)
πΊοΈ
Taxonomy Completionist
(41)
π
Keyword Trendsetter Combo
(26)
π
Conference Loyalist
(23)
π
Keyword Champion
(7)
π¬
Deep Specialist
(16)
π±
Topic Pioneer
π€
Dynamic Duo
(35)
π
Grand Slam
π
Triple Crown
π₯
Mega-Team
(34)
π
Trend Setter
β‘
Prolific Year
(21)
β
The Questioner
ποΈ
Keyword Collector
(188)
π
Conference Pioneer
π
Century Club
(215)
π₯
Unstoppable
(14)
Conferences
NIPS (61)
ICML (52)
ICLR (42)
CORL (23)
RSS (17)
AAAI (4)
CVPR (4)
JMLR (3)
ECCV (2)
IJCAI (2)
UAI (2)
AISTATS (1)
EMNLP (1)
ICCV (1)
L4DC (1)
Top co-authors
Research topics
Keywords
reinforcement learning
(49)
imitation learning
(16)
representation learning
(14)
model-based reinforcement learning
(11)
robot manipulation
(10)
deep reinforcement learning
(10)
off-policy learning
(9)
transfer learning
(8)
autoregressive model
(8)
policy learning
(7)
continuous control
(7)
robotic manipulation
(7)
world model
(6)
policy gradient
(6)
contrastive learning
(6)
self-supervised learning
(6)
offline reinforcement learning
(5)
density estimation
(5)
variational inference
(5)
few-shot learning
(5)
Papers
Cliqueformer: Model-Based Optimization with Structured Transformers
AAAI 2026
DexterityGen: Foundation Controller for Unprecedented Dexterity
RSS 2025
RoboVerse: A Unified Platform, Benchmark and Dataset for Scalable and Generalizable Robot Learning
RSS 2025
Demonstrating MuJoCo Playground
RSS 2025
Value-Based Deep RL Scales Predictably
ICML 2025
Chip Placement with Diffusion Models
ICML 2025
OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction
ICML 2025
Prioritized Generative Replay
ICLR 2025
One Step Diffusion via Shortcut Models
ICLR 2025
Protein Language Model Fitness is a Matter of Preference
ICLR 2025
ElasticTok: Adaptive Tokenization for Image and Video
ICLR 2025
World Model on Million-Length Video And Language With Blockwise RingAttention
ICLR 2025
SEMDICE: Off-policy State Entropy Maximization via Stationary Distribution Correction Estimation
ICLR 2025
MaxInfoRL: Boosting exploration in reinforcement learning through information gain maximization
ICLR 2025
Efficient Long Video Tokenization via Coordinate-based Patch Reconstruction
CVPR 2025
The Sound of Simulation: Learning Multimodal Sim-to-Real Robot Policies with Generative Audio
CORL 2025
Visual Imitation Enables Contextual Humanoid Control
CORL 2025
Learning Robotic Locomotion Affordances and Photorealistic Simulators from Human-Captured Data
CORL 2024
Vision Foundation Model Enables Generalizable Object Pose Estimation
NIPS 2024
A StrongREJECT for Empty Jailbreaks
NIPS 2024
Reinforcement Learning with Foundation Priors: Let Embodied Agent Efficiently Learn on Its Own
CORL 2024
Body Transformer: Leveraging Robot Embodiment for Policy Learning
CORL 2024
Twisting Lids Off with Two Hands
CORL 2024
Functional Graphical Models: Structure Enables Offline Data-Driven Optimization
AISTATS 2024
RingAttention with Blockwise Transformers for Near-Infinite Context
ICLR 2024
Chain of Hindsight aligns Language Models with Feedback
ICLR 2024
The False Promise of Imitating Proprietary Language Models
ICLR 2024
Video Language Planning
ICLR 2024
Probabilistic Adaptation of Black-Box Text-to-Video Models
ICLR 2024
Scalable Diffusion for Materials Generation
ICLR 2024
Learning Interactive Real-World Simulators
ICLR 2024
DreamSmooth: Improving Model-based Reinforcement Learning via Reward Smoothing
ICLR 2024
Tensor Trust: Interpretable Prompt Injection Attacks from an Online Game
ICLR 2024
Unsupervised Zero-Shot Reinforcement Learning via Functional Reward Encodings
ICML 2024
Visual Representation Learning with Stochastic Frame Prediction
ICML 2024
Learning to Model the World With Language
ICML 2024
Learning a Diffusion Model Policy from Rewards via Q-Score Matching
ICML 2024
Position: Video as the New Language for Real-World Decision Making
ICML 2024
Offline Imitation Learning Through Graph Search and Retrieval
RSS 2024
HumanoidBench: Simulated Humanoid Benchmark for Whole-Body Locomotion and Manipulation
RSS 2024
MOKA: Open-World Robotic Manipulation through Mark-Based Visual Prompting
RSS 2024
Any-point Trajectory Modeling for Policy Learning
RSS 2024
Guiding Pretraining in Reinforcement Learning with Large Language Models
ICML 2023
CLUTR: Curriculum Learning via Unsupervised Task Representation Learning
ICML 2023
Language Quantized AutoEncoders: Towards Unsupervised Text-Image Alignment
NIPS 2023
Blockwise Parallel Transformers for Large Context Models
NIPS 2023
Learning Universal Policies via Text-Guided Video Generation
NIPS 2023
Accelerating Reinforcement Learning with Value-Conditional State Entropy Exploration
NIPS 2023
Video Prediction Models as Rewards for Reinforcement Learning
NIPS 2023
AlberDICE: Addressing Out-Of-Distribution Joint Actions in Offline Multi-Agent RL via Alternating Stationary Distribution Correction Estimation
NIPS 2023
DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models
NIPS 2023
RoboPianist: Dexterous Piano Playing with Deep Reinforcement Learning
CORL 2023
Language-Conditioned Path Planning
CORL 2023
Where are we in the search for an Artificial Visual Cortex for Embodied Intelligence?
NIPS 2023
The Wisdom of Hindsight Makes Language Models Better Instruction Followers
ICML 2023
Temporally Consistent Transformers for Video Generation
ICML 2023
Robust and Versatile Bipedal Jumping Control through Reinforcement Learning
RSS 2023
Dichotomy of Control: Separating What You Can Control from What You Cannot
ICLR 2023
Become a Proficient Player with Limited Data through Watching Pure Videos
ICLR 2023
Masked Trajectory Models for Prediction, Representation, and Control
ICML 2023
Multi-Environment Pretraining Enables Transfer to Action Limited Datasets
ICML 2023
Improving Long-Horizon Imitation through Instruction Prediction
AAAI 2023
Preference Transformer: Modeling Human Preferences using Transformers for RL
ICLR 2023
Multi-View Masked World Models for Visual Robotic Manipulation
ICML 2023
Controllability-Aware Unsupervised Skill Discovery
ICML 2023
VectorFusion: Text-to-SVG by Abstracting Pixel-Based Diffusion Models
CVPR 2023
Emergent Agentic Transformer from Chain of Hindsight Experience
ICML 2023
Programmatic Modeling and Generation of Real-Time Strategic Soccer Environments for Reinforcement Learning
AAAI 2022
Autoregressive Uncertainty Modeling for 3D Bounding Box Prediction
ECCV 2022
Sim-to-Real 6D Object Pose Estimation via Iterative Self-Training for Robotic Bin Picking
ECCV 2022
On the Effectiveness of Fine-tuning Versus Meta-reinforcement Learning
NIPS 2022
Unsupervised Reinforcement Learning with Contrastive Intrinsic Control
NIPS 2022
SURF: Semi-supervised Reward Learning with Data Augmentation for Feedback-efficient Preference-based Reinforcement Learning
ICLR 2022
Reward Uncertainty for Exploration in Preference-based Reinforcement Learning
ICLR 2022
It Takes Four to Tango: Multiagent Self Play for Automatic Curriculum Generation
ICLR 2022
Chain of Thought Imitation with Procedure Cloning
NIPS 2022
AdaCat: Adaptive categorical discretization for autoregressive models
UAI 2022
Spending Thinking Time Wisely: Accelerating MCTS with Virtual Expansions
NIPS 2022
Masked Autoencoding for Scalable and Generalizable Decision Making
NIPS 2022
Fleet-DAgger: Interactive Robot Fleet Learning with Scalable Human Supervision
CORL 2022
Real-World Robot Learning with Masked Visual Pre-training
CORL 2022
Masked World Models for Visual Control
CORL 2022
Sim-to-Real via Sim-to-Seg: End-to-end Off-road Autonomous Driving Without Real Data
CORL 2022
DayDreamer: World Models for Physical Robot Learning
CORL 2022
Reinforcement Learning with Action-Free Pre-Training from Videos
ICML 2022
Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
ICML 2022
Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents
ICML 2022
Hierarchical Few-Shot Imitation with Skill Transition Models
ICLR 2022
Frozen Pretrained Transformers as Universal Computation Engines
AAAI 2022
Deep Hierarchical Planning from Pixels
NIPS 2022
Zero-Shot Text-Guided Object Generation With Dream Fields
CVPR 2022
Reinforcement Learning with Latent Flow
NIPS 2021
Mastering Atari Games with Limited Data
NIPS 2021
Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings
NIPS 2021
Skill Preferences: Learning to Extract and Execute Robotic Skills from Human Feedback
CORL 2021
Offline-to-Online Reinforcement Learning via Balanced Replay and Pessimistic Q-Ensemble
CORL 2021
Bottleneck Transformers for Visual Recognition
CVPR 2021
Contrastive Code Representation Learning
EMNLP 2021
Putting NeRF on a Diet: Semantically Consistent Few-Shot View Synthesis
ICCV 2021
Reset-Free Lifelong Learning with Skill-Space Planning
ICLR 2021
Efficient Empowerment Estimation for Unsupervised Stabilization
ICLR 2021
Learning What To Do by Simulating the Past
ICLR 2021
Task-Agnostic Morphology Evolution
ICLR 2021
Self-Supervised Policy Adaptation during Deployment
ICLR 2021
Mutual Information State Intrinsic Control
ICLR 2021
Unsupervised Learning of Visual 3D Keypoints for Control
ICML 2021
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
ICML 2021
PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via Relabeling Experience and Unsupervised Pre-training
ICML 2021
APS: Active Pretraining with Successor Features
ICML 2021
MSA Transformer
ICML 2021
State Entropy Maximization with Random Encoders for Efficient Exploration
ICML 2021
Decoupling Representation Learning from Reinforcement Learning
ICML 2021
Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL
NIPS 2021
Teachable Reinforcement Learning via Advice Distillation
NIPS 2021
Decision Transformer: Reinforcement Learning via Sequence Modeling
NIPS 2021
Behavior From the Void: Unsupervised Active Pre-Training
NIPS 2021
Generalized Hindsight for Reinforcement Learning
NIPS 2020
Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning
NIPS 2020
Reinforcement Learning with Augmented Data
NIPS 2020
Locally Masked Convolution for Autoregressive Models
UAI 2020
AVID: Learning Multi-Stage Tasks via Pixel-Level Translation of Human Videos
RSS 2020
Learning Predictive Representations for Deformable Objects Using Contrastive Estimation
CORL 2020
Visual Imitation Made Easy
CORL 2020
Plan2Vec: Unsupervised Representation Learning by Latent Plans
L4DC 2020
Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model
NIPS 2020
Learning to Manipulate Deformable Objects without Demonstrations
RSS 2020
Hallucinative Topological Memory for Zero-Shot Visual Planning
ICML 2020
Model-Augmented Actor-Critic: Backpropagating through Paths
ICLR 2020
Sub-policy Adaptation for Hierarchical Reinforcement Learning
ICLR 2020
Planning to Explore via Self-Supervised World Models
ICML 2020
Responsive Safety in Reinforcement Learning by PID Lagrangian Methods
ICML 2020
Variable Skipping for Autoregressive Range Density Estimation
ICML 2020
AvE: Assistance via Empowerment
NIPS 2020
Sparse Graphical Memory for Robust Planning
NIPS 2020
Denoising Diffusion Probabilistic Models
NIPS 2020
Hierarchically Decoupled Imitation For Morphological Transfer
ICML 2020
CURL: Contrastive Unsupervised Representations for Reinforcement Learning
ICML 2020
Automatic Curriculum Learning through Value Disagreement
NIPS 2020
Population Based Augmentation: Efficient Learning of Augmentation Policy Schedules
ICML 2019
Geometry-Aware Neural Rendering
NIPS 2019
Learning to Adapt in Dynamic, Real-World Environments through Meta-Reinforcement Learning
ICLR 2019
ProMP: Proximal Meta-Policy Search
ICLR 2019
Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow
ICLR 2019
Preferences Implicit in the State of the World
ICLR 2019
Guiding Policies with Language via Meta-Learning
ICLR 2019
MCP: Learning Composable Hierarchical Control with Multiplicative Compositional Policies
NIPS 2019
Addressing Sample Complexity in Visual Tasks Using HER and Hallucinatory GANs
NIPS 2019
Evaluating Protein Transfer Learning with TAPE
NIPS 2019
Compositional Plan Vectors
NIPS 2019
Asynchronous Methods for Model-Based Reinforcement Learning
CORL 2019
Guided Meta-Policy Search
NIPS 2019
Goal-conditioned Imitation Learning
NIPS 2019
Compression with Flows via Local Bits-Back Coding
NIPS 2019
On the Utility of Learning about Humans for Human-AI Coordination
NIPS 2019
Learning Robotic Manipulation through Visual Planning and Acting
RSS 2019
Flow++: Improving Flow-Based Generative Models with Variational Dequantization and Architecture Design
ICML 2019
Bit-Swap: Recursive Bits-Back Coding for Lossless Compression with Hierarchical Latent Variables
ICML 2019
On the Feasibility of Learning, Rather than Assuming, Human Biases for Reward Inference
ICML 2019
SOLAR: Deep Structured Representations for Model-Based Reinforcement Learning
ICML 2019
Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments
ICLR 2018
Parameter Space Noise for Exploration
ICLR 2018
Learning Plannable Representations with Causal InfoGAN
NIPS 2018
One-Shot Imitation from Observing Humans via Domain-Adaptive Meta-Learning
RSS 2018
Asymmetric Actor Critic for Image-Based Robot Learning
RSS 2018
PixelSNAIL: An Improved Autoregressive Generative Model
ICML 2018
Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings
ICML 2018
Automatic Goal Generation for Reinforcement Learning Agents
ICML 2018
Latent Space Policies for Hierarchical Reinforcement Learning
ICML 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
ICML 2018
Universal Planning Networks: Learning Generalizable Representations for Visuomotor Control
ICML 2018
Composable Action-Conditioned Predictors: Flexible Off-Policy Learning for Robot Navigation
CORL 2018
Model-Based Reinforcement Learning via Meta-Policy Optimization
CORL 2018
Meta-Reinforcement Learning of Structured Exploration Strategies
NIPS 2018
Evolved Policy Gradients
NIPS 2018
The Importance of Sampling inMeta-Reinforcement Learning
NIPS 2018
Model-Ensemble Trust-Region Policy Optimization
ICLR 2018
Variance Reduction for Policy Gradient with Action-Dependent Factorized Baselines
ICLR 2018
A Simple Neural Attentive Meta-Learner
ICLR 2018
META LEARNING SHARED HIERARCHIES
ICLR 2018
One-Shot Visual Imitation Learning via Meta-Learning
CORL 2017
Mutual Alignment Transfer Learning
CORL 2017
The Off-Switch Game
IJCAI 2017
Value Iteration Networks
IJCAI 2017
Enabling Robots to Communicate Their Objectives
RSS 2017
One-Shot Imitation Learning
NIPS 2017
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning
NIPS 2017
Prediction and Control with Temporal Segment Models
ICML 2017
Reinforcement Learning with Deep Energy-Based Policies
ICML 2017
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
ICML 2017
Constrained Policy Optimization
ICML 2017
Inverse Reward Design
NIPS 2017
Reverse Curriculum Generation for Reinforcement Learning
CORL 2017
Cooperative Inverse Reinforcement Learning
NIPS 2016
Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization
ICML 2016
Backprop KF: Learning Discriminative Deterministic State Estimators
NIPS 2016
End-to-End Training of Deep Visuomotor Policies
JMLR 2016
InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets
NIPS 2016
VIME: Variational Information Maximizing Exploration
NIPS 2016
Learning to Poke by Poking: Experiential Learning of Intuitive Physics
NIPS 2016
Benchmarking Deep Reinforcement Learning for Continuous Control
ICML 2016
Combinatorial Energy Learning for Image Segmentation
NIPS 2016
Value Iteration Networks
NIPS 2016
Alpha-Beta Divergences Discover Micro and Macro Structures in Data
ICML 2015
Information-Theoretic Planning with Trajectory Optimization for Dense 3D Mapping
RSS 2015
Gradient Estimation Using Stochastic Computation Graphs
NIPS 2015
Trust Region Policy Optimization
ICML 2015
Learning Neural Network Policies with Guided Policy Search under Unknown Dynamics
NIPS 2014
Finding Locally Optimal, Collision-Free Trajectories with Sequential Convex Optimization
RSS 2013
Risk Aversion in Markov Decision Processes via Near Optimal Chernoff Bounds
NIPS 2012
On a Connection between Importance Sampling and the Likelihood Ratio Policy Gradient
NIPS 2010
Max-margin Classification of Data with Absent Features
JMLR 2008
Hierarchical Apprenticeship Learning with Application to Quadruped Locomotion
NIPS 2007
Learning Factor Graphs in Polynomial Time and Sample Complexity
JMLR 2006
Max-margin classification of incomplete data
NIPS 2006
An Application of Reinforcement Learning to Aerobatic Helicopter Flight
NIPS 2006
Discriminative Training of Kalman Filters
RSS 2005