Reinforcement Learning › Methods ›

Deep RL

3861 directly classified papers

Papers per year

Papers

Divergence-Regularized Multi-Agent Actor-Critic ICML 2022

Biased Gradient Estimate with Drastic Variance Reduction for Meta Reinforcement Learning ICML 2022

Towards Evaluating Adaptivity of Model-Based Reinforcement Learning Methods ICML 2022

Denoised MDPs: Learning World Models Better Than the World Itself ICML 2022

Model-based Meta Reinforcement Learning using Graph Structured Surrogate Models and Amortized Policy Search ICML 2022

Policy Gradient Method For Robust Reinforcement Learning ICML 2022

Robust Deep Reinforcement Learning through Bootstrapped Opportunistic Curriculum ICML 2022

Prompting Decision Transformer for Few-Shot Policy Generalization ICML 2022

Reachability Constrained Reinforcement Learning ICML 2022

Topology-Aware Network Pruning using Multi-stage Graph Embedding and Reinforcement Learning ICML 2022

Actor-Critic based Improper Reinforcement Learning ICML 2022

Stabilizing Q-learning with Linear Architectures for Provable Efficient Learning ICML 2022

Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning approach ICML 2022

Dynamic Regret of Online Markov Decision Processes ICML 2022

Efficient Learning for AlphaZero via Path Consistency ICML 2022

Online Decision Transformer ICML 2022

Sequential Voting With Relational Box Fields for Active Object Detection CVPR 2022

Near-Optimal Algorithms for Autonomous Exploration and Multi-Goal Stochastic Shortest Path ICML 2022

Improved No-Regret Algorithms for Stochastic Shortest Path with Linear MDP ICML 2022

Cooperative Online Learning in Stochastic and Adversarial MDPs ICML 2022

Joint Synthesis of Safety Certificate and Safe Control Policy Using Constrained Reinforcement Learning L4DC 2022

Experience Replay with Likelihood-free Importance Weights L4DC 2022

Safe Reinforcement Learning with Chance-constrained Model Predictive Control L4DC 2022

Reinforcement Learning with Almost Sure Constraints L4DC 2022

Block Contextual MDPs for Continual Learning L4DC 2022