conftrace_

Reinforcement Learning › Methods ›

Policy Learning

2,076 papers

Papers per year

6

1

1

11

10

14

9

23

15

25

25

24

23

27

61

107

187

216

274

259

321

247

153

37

'10

'15

'20

'25

Papers

Dr Jekyll & Mr Hyde: the strange case of off-policy policy updates NIPS 2021

Learning Barrier Certificates: Towards Safe Reinforcement Learning with Zero Training-time Violations NIPS 2021

Reward is enough for convex MDPs NIPS 2021

Navigating to the Best Policy in Markov Decision Processes NIPS 2021

Robust Inverse Reinforcement Learning under Transition Dynamics Mismatch NIPS 2021

Coordinated Proximal Policy Optimization NIPS 2021

Is Bang-Bang Control All You Need? Solving Continuous Control with Bernoulli Policies NIPS 2021

Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning NIPS 2021

Robust Predictable Control NIPS 2021

MobILE: Model-Based Imitation Learning From Observation Alone NIPS 2021

Stabilizing Dynamical Systems via Policy Gradient Methods NIPS 2021

Model-Based Episodic Memory Induces Dynamic Hybrid Controls NIPS 2021

Implicit Behavioral Cloning CORL 2021

Learning Feasibility to Imitate Demonstrators with Different Dynamics CORL 2021

XIRL: Cross-embodiment Inverse Reinforcement Learning CORL 2021

ThriftyDAgger: Budget-Aware Novelty and Risk Gating for Interactive Imitation Learning CORL 2021

A Constrained Multi-Objective Reinforcement Learning Framework CORL 2021

Redundancy Resolution as Action Bias in Policy Search for Robotic Manipulation CORL 2021

Specializing Versatile Skill Libraries using Local Mixture of Experts CORL 2021

SCAPE: Learning Stiffness Control from Augmented Position Control Experiences CORL 2021

You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL CORL 2021

"Good Robot! Now Watch This!": Repurposing Reinforcement Learning for Task-to-Task Transfer CORL 2021

Commission Fee is not Enough: A Hierarchical Reinforced Framework for Portfolio Management AAAI 2021

Learning to Recommend from Sparse Data via Generative User Feedback AAAI 2021

Hierarchical Reinforcement Learning for Integrated Recommendation AAAI 2021