conftrace_

Reinforcement Learning › Methods ›

Deep RL

3,886 papers

Papers per year

1

9

14

15

9

21

27

32

21

17

10

33

102

222

399

450

533

478

532

513

326

122

'05

'10

'15

'20

'25

Papers

Efficient Reinforcement Learning for High Dimensional Linear Quadratic Systems NIPS 2012

Value Pursuit Iteration NIPS 2012

Sketch-Based Linear Value Function Approximation NIPS 2012

Tractable Objectives for Robust Policy Optimization NIPS 2012

Robustness and risk-sensitivity in Markov decision processes NIPS 2012

Risk Aversion in Markov Decision Processes via Near Optimal Chernoff Bounds NIPS 2012

Online Regret Bounds for Undiscounted Continuous Reinforcement Learning NIPS 2012

Weighted Likelihood Policy Search with Model Selection NIPS 2012

Multi-objective Monte-Carlo Tree Search ACML 2012

Contextual Bandit Learning with Predictable Rewards AISTATS 2012

Optimistic planning for Markov decision processes AISTATS 2012

Exploration in Relational Domains for Model-based Reinforcement Learning JMLR 2012

Reducing Conservativeness in Safety Guarantees by Learning Disturbances Online: Iterated Guaranteed Safe Online Learning RSS 2012

On Stochastic Optimal Control and Reinforcement Learning by Approximate Inference RSS 2012

Tendon-Driven Variable Impedance Control Using Reinforcement Learning RSS 2012

Action-Gap Phenomenon in Reinforcement Learning NIPS 2011

Policy Gradient Coagent Networks NIPS 2011

A Non-Parametric Approach to Dynamic Programming NIPS 2011

Convergent Fitted Value Iteration with Linear Function Approximation NIPS 2011

Analysis and Improvement of Policy Gradient Estimation NIPS 2011

Learning to Agglomerate Superpixel Hierarchies NIPS 2011

Selecting the State-Representation in Reinforcement Learning NIPS 2011

A Reinforcement Learning Theory for Homeostatic Regulation NIPS 2011

Speedy Q-Learning NIPS 2011

Reinforcement Learning using Kernel-Based Stochastic Factorization NIPS 2011