conftrace_

Pieter Abbeel

216 papers · 2005–2026 · 15 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+19 more ↓

🗺️ Taxonomy Completionist (41) 🧭 Keyword Pioneer 🌈 Renaissance Researcher (7) 🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird

🧭 Keyword Pioneer 🌈 Renaissance Researcher (7) 🗺️ Taxonomy Completionist (41) 🌟 Keyword Trendsetter Combo (26) 🏠 Conference Loyalist (23) 🏆 Keyword Champion (7) 🔬 Deep Specialist (16) 🌱 Topic Pioneer 🤝 Dynamic Duo (35) 🏆 Grand Slam 👑 Triple Crown 👥 Mega-Team (34) 📈 Trend Setter ⚡ Prolific Year (21) ❓ The Questioner 🗃️ Keyword Collector (188) 🚀 Conference Pioneer 💎 Century Club (215) 🔥 Unstoppable (14)

Conferences

NIPS (61) ICML (52) ICLR (42) CORL (23) RSS (17) AAAI (4) CVPR (4) JMLR (3) ECCV (2) IJCAI (2) UAI (2) AISTATS (1) EMNLP (1) ICCV (1) L4DC (1)

Top co-authors

Sergey Levine (36) Kimin Lee (19) Hao Liu (14) Yan Duan (11) Lerrel Pinto (11) Anca Dragan (10) Xi Chen (10) Igor Mordatch (10) Younggyo Seo (10) Stephen James (9)

Research topics

Reinforcement Learning (2) Robotics (1)

Keywords

reinforcement learning (49) imitation learning (16) representation learning (14) model-based reinforcement learning (11) robot manipulation (10) deep reinforcement learning (10) off-policy learning (9) transfer learning (8) autoregressive model (8) policy learning (7) continuous control (7) robotic manipulation (7) world model (6) policy gradient (6) contrastive learning (6) self-supervised learning (6) offline reinforcement learning (5) density estimation (5) variational inference (5) few-shot learning (5)

Papers

Cliqueformer: Model-Based Optimization with Structured Transformers AAAI 2026 DexterityGen: Foundation Controller for Unprecedented Dexterity RSS 2025 RoboVerse: A Unified Platform, Benchmark and Dataset for Scalable and Generalizable Robot Learning RSS 2025 Demonstrating MuJoCo Playground RSS 2025 Value-Based Deep RL Scales Predictably ICML 2025 Chip Placement with Diffusion Models ICML 2025 OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction ICML 2025 Prioritized Generative Replay ICLR 2025 One Step Diffusion via Shortcut Models ICLR 2025 Protein Language Model Fitness is a Matter of Preference ICLR 2025 ElasticTok: Adaptive Tokenization for Image and Video ICLR 2025 World Model on Million-Length Video And Language With Blockwise RingAttention ICLR 2025 SEMDICE: Off-policy State Entropy Maximization via Stationary Distribution Correction Estimation ICLR 2025 MaxInfoRL: Boosting exploration in reinforcement learning through information gain maximization ICLR 2025 Efficient Long Video Tokenization via Coordinate-based Patch Reconstruction CVPR 2025 The Sound of Simulation: Learning Multimodal Sim-to-Real Robot Policies with Generative Audio CORL 2025 Visual Imitation Enables Contextual Humanoid Control CORL 2025 Learning Robotic Locomotion Affordances and Photorealistic Simulators from Human-Captured Data CORL 2024 Vision Foundation Model Enables Generalizable Object Pose Estimation NIPS 2024 A StrongREJECT for Empty Jailbreaks NIPS 2024 Reinforcement Learning with Foundation Priors: Let Embodied Agent Efficiently Learn on Its Own CORL 2024 Body Transformer: Leveraging Robot Embodiment for Policy Learning CORL 2024 Twisting Lids Off with Two Hands CORL 2024 Functional Graphical Models: Structure Enables Offline Data-Driven Optimization AISTATS 2024 RingAttention with Blockwise Transformers for Near-Infinite Context ICLR 2024 Chain of Hindsight aligns Language Models with Feedback ICLR 2024 The False Promise of Imitating Proprietary Language Models ICLR 2024 Video Language Planning ICLR 2024 Probabilistic Adaptation of Black-Box Text-to-Video Models ICLR 2024 Scalable Diffusion for Materials Generation ICLR 2024 Learning Interactive Real-World Simulators ICLR 2024 DreamSmooth: Improving Model-based Reinforcement Learning via Reward Smoothing ICLR 2024 Tensor Trust: Interpretable Prompt Injection Attacks from an Online Game ICLR 2024 Unsupervised Zero-Shot Reinforcement Learning via Functional Reward Encodings ICML 2024 Visual Representation Learning with Stochastic Frame Prediction ICML 2024 Learning to Model the World With Language ICML 2024 Learning a Diffusion Model Policy from Rewards via Q-Score Matching ICML 2024 Position: Video as the New Language for Real-World Decision Making ICML 2024 Offline Imitation Learning Through Graph Search and Retrieval RSS 2024 HumanoidBench: Simulated Humanoid Benchmark for Whole-Body Locomotion and Manipulation RSS 2024 MOKA: Open-World Robotic Manipulation through Mark-Based Visual Prompting RSS 2024 Any-point Trajectory Modeling for Policy Learning RSS 2024 Guiding Pretraining in Reinforcement Learning with Large Language Models ICML 2023 CLUTR: Curriculum Learning via Unsupervised Task Representation Learning ICML 2023 Language Quantized AutoEncoders: Towards Unsupervised Text-Image Alignment NIPS 2023 Blockwise Parallel Transformers for Large Context Models NIPS 2023 Learning Universal Policies via Text-Guided Video Generation NIPS 2023 Accelerating Reinforcement Learning with Value-Conditional State Entropy Exploration NIPS 2023 Video Prediction Models as Rewards for Reinforcement Learning NIPS 2023 AlberDICE: Addressing Out-Of-Distribution Joint Actions in Offline Multi-Agent RL via Alternating Stationary Distribution Correction Estimation NIPS 2023 DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models NIPS 2023 RoboPianist: Dexterous Piano Playing with Deep Reinforcement Learning CORL 2023 Language-Conditioned Path Planning CORL 2023 Where are we in the search for an Artificial Visual Cortex for Embodied Intelligence? NIPS 2023 The Wisdom of Hindsight Makes Language Models Better Instruction Followers ICML 2023 Temporally Consistent Transformers for Video Generation ICML 2023 Robust and Versatile Bipedal Jumping Control through Reinforcement Learning RSS 2023 Dichotomy of Control: Separating What You Can Control from What You Cannot ICLR 2023 Become a Proficient Player with Limited Data through Watching Pure Videos ICLR 2023 Masked Trajectory Models for Prediction, Representation, and Control ICML 2023 Multi-Environment Pretraining Enables Transfer to Action Limited Datasets ICML 2023 Improving Long-Horizon Imitation through Instruction Prediction AAAI 2023 Preference Transformer: Modeling Human Preferences using Transformers for RL ICLR 2023 Multi-View Masked World Models for Visual Robotic Manipulation ICML 2023 Controllability-Aware Unsupervised Skill Discovery ICML 2023 VectorFusion: Text-to-SVG by Abstracting Pixel-Based Diffusion Models CVPR 2023 Emergent Agentic Transformer from Chain of Hindsight Experience ICML 2023 Programmatic Modeling and Generation of Real-Time Strategic Soccer Environments for Reinforcement Learning AAAI 2022 Autoregressive Uncertainty Modeling for 3D Bounding Box Prediction ECCV 2022 Sim-to-Real 6D Object Pose Estimation via Iterative Self-Training for Robotic Bin Picking ECCV 2022 On the Effectiveness of Fine-tuning Versus Meta-reinforcement Learning NIPS 2022 Unsupervised Reinforcement Learning with Contrastive Intrinsic Control NIPS 2022 SURF: Semi-supervised Reward Learning with Data Augmentation for Feedback-efficient Preference-based Reinforcement Learning ICLR 2022 Reward Uncertainty for Exploration in Preference-based Reinforcement Learning ICLR 2022 It Takes Four to Tango: Multiagent Self Play for Automatic Curriculum Generation ICLR 2022 Chain of Thought Imitation with Procedure Cloning NIPS 2022 AdaCat: Adaptive categorical discretization for autoregressive models UAI 2022 Spending Thinking Time Wisely: Accelerating MCTS with Virtual Expansions NIPS 2022 Masked Autoencoding for Scalable and Generalizable Decision Making NIPS 2022 Fleet-DAgger: Interactive Robot Fleet Learning with Scalable Human Supervision CORL 2022 Real-World Robot Learning with Masked Visual Pre-training CORL 2022 Masked World Models for Visual Control CORL 2022 Sim-to-Real via Sim-to-Seg: End-to-end Off-road Autonomous Driving Without Real Data CORL 2022 DayDreamer: World Models for Physical Robot Learning CORL 2022 Reinforcement Learning with Action-Free Pre-Training from Videos ICML 2022 Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks ICML 2022 Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents ICML 2022 Hierarchical Few-Shot Imitation with Skill Transition Models ICLR 2022 Frozen Pretrained Transformers as Universal Computation Engines AAAI 2022 Deep Hierarchical Planning from Pixels NIPS 2022 Zero-Shot Text-Guided Object Generation With Dream Fields CVPR 2022 Reinforcement Learning with Latent Flow NIPS 2021 Mastering Atari Games with Limited Data NIPS 2021 Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings NIPS 2021 Skill Preferences: Learning to Extract and Execute Robotic Skills from Human Feedback CORL 2021 Offline-to-Online Reinforcement Learning via Balanced Replay and Pessimistic Q-Ensemble CORL 2021 Bottleneck Transformers for Visual Recognition CVPR 2021 Contrastive Code Representation Learning EMNLP 2021 Putting NeRF on a Diet: Semantically Consistent Few-Shot View Synthesis ICCV 2021 Reset-Free Lifelong Learning with Skill-Space Planning ICLR 2021 Efficient Empowerment Estimation for Unsupervised Stabilization ICLR 2021 Learning What To Do by Simulating the Past ICLR 2021 Task-Agnostic Morphology Evolution ICLR 2021 Self-Supervised Policy Adaptation during Deployment ICLR 2021 Mutual Information State Intrinsic Control ICLR 2021 Unsupervised Learning of Visual 3D Keypoints for Control ICML 2021 SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning ICML 2021 PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via Relabeling Experience and Unsupervised Pre-training ICML 2021 APS: Active Pretraining with Successor Features ICML 2021 MSA Transformer ICML 2021 State Entropy Maximization with Random Encoders for Efficient Exploration ICML 2021 Decoupling Representation Learning from Reinforcement Learning ICML 2021 Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL NIPS 2021 Teachable Reinforcement Learning via Advice Distillation NIPS 2021 Decision Transformer: Reinforcement Learning via Sequence Modeling NIPS 2021 Behavior From the Void: Unsupervised Active Pre-Training NIPS 2021 Generalized Hindsight for Reinforcement Learning NIPS 2020 Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning NIPS 2020 Reinforcement Learning with Augmented Data NIPS 2020 Locally Masked Convolution for Autoregressive Models UAI 2020 AVID: Learning Multi-Stage Tasks via Pixel-Level Translation of Human Videos RSS 2020 Learning Predictive Representations for Deformable Objects Using Contrastive Estimation CORL 2020 Visual Imitation Made Easy CORL 2020 Plan2Vec: Unsupervised Representation Learning by Latent Plans L4DC 2020 Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model NIPS 2020 Learning to Manipulate Deformable Objects without Demonstrations RSS 2020 Hallucinative Topological Memory for Zero-Shot Visual Planning ICML 2020 Model-Augmented Actor-Critic: Backpropagating through Paths ICLR 2020 Sub-policy Adaptation for Hierarchical Reinforcement Learning ICLR 2020 Planning to Explore via Self-Supervised World Models ICML 2020 Responsive Safety in Reinforcement Learning by PID Lagrangian Methods ICML 2020 Variable Skipping for Autoregressive Range Density Estimation ICML 2020 AvE: Assistance via Empowerment NIPS 2020 Sparse Graphical Memory for Robust Planning NIPS 2020 Denoising Diffusion Probabilistic Models NIPS 2020 Hierarchically Decoupled Imitation For Morphological Transfer ICML 2020 CURL: Contrastive Unsupervised Representations for Reinforcement Learning ICML 2020 Automatic Curriculum Learning through Value Disagreement NIPS 2020 Population Based Augmentation: Efficient Learning of Augmentation Policy Schedules ICML 2019 Geometry-Aware Neural Rendering NIPS 2019 Learning to Adapt in Dynamic, Real-World Environments through Meta-Reinforcement Learning ICLR 2019 ProMP: Proximal Meta-Policy Search ICLR 2019 Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow ICLR 2019 Preferences Implicit in the State of the World ICLR 2019 Guiding Policies with Language via Meta-Learning ICLR 2019 MCP: Learning Composable Hierarchical Control with Multiplicative Compositional Policies NIPS 2019 Addressing Sample Complexity in Visual Tasks Using HER and Hallucinatory GANs NIPS 2019 Evaluating Protein Transfer Learning with TAPE NIPS 2019 Compositional Plan Vectors NIPS 2019 Asynchronous Methods for Model-Based Reinforcement Learning CORL 2019 Guided Meta-Policy Search NIPS 2019 Goal-conditioned Imitation Learning NIPS 2019 Compression with Flows via Local Bits-Back Coding NIPS 2019 On the Utility of Learning about Humans for Human-AI Coordination NIPS 2019 Learning Robotic Manipulation through Visual Planning and Acting RSS 2019 Flow++: Improving Flow-Based Generative Models with Variational Dequantization and Architecture Design ICML 2019 Bit-Swap: Recursive Bits-Back Coding for Lossless Compression with Hierarchical Latent Variables ICML 2019 On the Feasibility of Learning, Rather than Assuming, Human Biases for Reward Inference ICML 2019 SOLAR: Deep Structured Representations for Model-Based Reinforcement Learning ICML 2019 Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments ICLR 2018 Parameter Space Noise for Exploration ICLR 2018 Learning Plannable Representations with Causal InfoGAN NIPS 2018 One-Shot Imitation from Observing Humans via Domain-Adaptive Meta-Learning RSS 2018 Asymmetric Actor Critic for Image-Based Robot Learning RSS 2018 PixelSNAIL: An Improved Autoregressive Generative Model ICML 2018 Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings ICML 2018 Automatic Goal Generation for Reinforcement Learning Agents ICML 2018 Latent Space Policies for Hierarchical Reinforcement Learning ICML 2018 Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor ICML 2018 Universal Planning Networks: Learning Generalizable Representations for Visuomotor Control ICML 2018 Composable Action-Conditioned Predictors: Flexible Off-Policy Learning for Robot Navigation CORL 2018 Model-Based Reinforcement Learning via Meta-Policy Optimization CORL 2018 Meta-Reinforcement Learning of Structured Exploration Strategies NIPS 2018 Evolved Policy Gradients NIPS 2018 The Importance of Sampling inMeta-Reinforcement Learning NIPS 2018 Model-Ensemble Trust-Region Policy Optimization ICLR 2018 Variance Reduction for Policy Gradient with Action-Dependent Factorized Baselines ICLR 2018 A Simple Neural Attentive Meta-Learner ICLR 2018 META LEARNING SHARED HIERARCHIES ICLR 2018 One-Shot Visual Imitation Learning via Meta-Learning CORL 2017 Mutual Alignment Transfer Learning CORL 2017 The Off-Switch Game IJCAI 2017 Value Iteration Networks IJCAI 2017 Enabling Robots to Communicate Their Objectives RSS 2017 One-Shot Imitation Learning NIPS 2017 #Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning NIPS 2017 Prediction and Control with Temporal Segment Models ICML 2017 Reinforcement Learning with Deep Energy-Based Policies ICML 2017 Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks ICML 2017 Constrained Policy Optimization ICML 2017 Inverse Reward Design NIPS 2017 Reverse Curriculum Generation for Reinforcement Learning CORL 2017 Cooperative Inverse Reinforcement Learning NIPS 2016 Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization ICML 2016 Backprop KF: Learning Discriminative Deterministic State Estimators NIPS 2016 End-to-End Training of Deep Visuomotor Policies JMLR 2016 InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets NIPS 2016 VIME: Variational Information Maximizing Exploration NIPS 2016 Learning to Poke by Poking: Experiential Learning of Intuitive Physics NIPS 2016 Benchmarking Deep Reinforcement Learning for Continuous Control ICML 2016 Combinatorial Energy Learning for Image Segmentation NIPS 2016 Value Iteration Networks NIPS 2016 Alpha-Beta Divergences Discover Micro and Macro Structures in Data ICML 2015 Information-Theoretic Planning with Trajectory Optimization for Dense 3D Mapping RSS 2015 Gradient Estimation Using Stochastic Computation Graphs NIPS 2015 Trust Region Policy Optimization ICML 2015 Learning Neural Network Policies with Guided Policy Search under Unknown Dynamics NIPS 2014 Finding Locally Optimal, Collision-Free Trajectories with Sequential Convex Optimization RSS 2013 Risk Aversion in Markov Decision Processes via Near Optimal Chernoff Bounds NIPS 2012 On a Connection between Importance Sampling and the Likelihood Ratio Policy Gradient NIPS 2010 Max-margin Classification of Data with Absent Features JMLR 2008 Hierarchical Apprenticeship Learning with Application to Quadruped Locomotion NIPS 2007 Learning Factor Graphs in Polynomial Time and Sample Complexity JMLR 2006 Max-margin classification of incomplete data NIPS 2006 An Application of Reinforcement Learning to Aerobatic Helicopter Flight NIPS 2006 Discriminative Training of Kalman Filters RSS 2005