Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3861 directly classified papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
Off-Policy Average Reward Actor-Critic with Deterministic Policy Search
ICML 2023
Posterior Sampling for Deep Reinforcement Learning
ICML 2023
Mastering the Unsupervised Reinforcement Learning Benchmark from Pixels
ICML 2023
Truncating Trajectories in Monte Carlo Reinforcement Learning
ICML 2023
Towards Learning to Imitate from a Single Video Demonstration
JMLR 2023
Learning Good State and Action Representations for Markov Decision Process via Tensor Decomposition
JMLR 2023
F2A2: Flexible Fully-decentralized Approximate Actor-critic for Cooperative Multi-agent Reinforcement Learning
JMLR 2023
A Unified Framework for Factorizing Distributional Value Functions for Multi-Agent Reinforcement Learning
JMLR 2023
Single Timescale Actor-Critic Method to Solve the Linear Quadratic Regulator with Convergence Guarantees
JMLR 2023
Neural Q-learning for solving PDEs
JMLR 2023
Model-based Reinforcement Learning with Scalable Composite Policy Gradient Estimators
ICML 2023
Predictable MDP Abstraction for Unsupervised Model-Based RL
ICML 2023
On Many-Actions Policy Gradient
ICML 2023
Self-Supervised Behavior Cloned Transformers are Path Crawlers for Text Games
EMNLP 2023
3D-Aware Object Goal Navigation via Simultaneous Exploration and Identification
CVPR 2023
DexArt: Benchmarking Generalizable Dexterous Manipulation With Articulated Objects
CVPR 2023
Dynamic Inference With Grounding Based Vision and Language Models
CVPR 2023
Trace and Pace: Controllable Pedestrian Animation via Guided Trajectory Diffusion
CVPR 2023
Co-Speech Gesture Synthesis by Reinforcement Learning With Contrastive Pre-Trained Rewards
CVPR 2023
PIRLNav: Pretraining With Imitation and RL Finetuning for ObjectNav
CVPR 2023
EXCALIBUR: Encouraging and Evaluating Embodied Exploration
CVPR 2023
Learning Human-to-Robot Handovers From Point Clouds
CVPR 2023
Discovering Object-Centric Generalized Value Functions From Pixels
ICML 2023
Representation Learning with Multi-Step Inverse Kinematics: An Efficient and Optimal Approach to Rich-Observation RL
ICML 2023
Understanding Plasticity in Neural Networks
ICML 2023
<
1
…
54
55
56
…
155
>