conftrace_

Reinforcement Learning › Methods ›

Deep RL

3,861 papers

Papers per year

Papers

Meta-Reinforcement Learning for Mastering Multiple Skills and Generalizing across Environments in Text-based Games IJCNLP 2021

Offline Reinforcement Learning from Human Feedback in Real-World Sequence-to-Sequence Tasks IJCNLP 2021

Variable Frame Rate Acoustic Models Using Minimum Error Reinforcement Learning INTERSPEECH 2021

Reinforcement Learning for Emotional Text-to-Speech Synthesis with Improved Emotion Discriminability INTERSPEECH 2021

Learning and Planning for Time-Varying MDPs Using Maximum Likelihood Estimation JMLR 2021

Risk-Averse Learning by Temporal Difference Methods with Markov Risk Measures JMLR 2021

ChainerRL: A Deep Reinforcement Learning Library JMLR 2021

Safe Policy Iteration: A Monotonically Improving Approximate Policy Iteration Approach JMLR 2021

MushroomRL: Simplifying Reinforcement Learning Research JMLR 2021

Hamilton-Jacobi Deep Q-Learning for Deterministic Continuous-Time Systems with Lipschitz Continuous Controls JMLR 2021

Stable-Baselines3: Reliable Reinforcement Learning Implementations JMLR 2021

Gaussian Approximation for Bias Reduction in Q-Learning JMLR 2021

VariBAD: Variational Bayes-Adaptive Deep RL via Meta-Learning JMLR 2021

On the Model-Based Stochastic Value Gradient for Continuous Reinforcement Learning L4DC 2021

Estimating Disentangled Belief about Hidden State and Hidden Task for Meta-Reinforcement Learning L4DC 2021

Learning to Actively Reduce Memory Requirements for Robot Control Tasks L4DC 2021

Abstraction-based branch and bound approach to Q-learning for hybrid optimal control L4DC 2021

Safe Reinforcement Learning of Control-Affine Systems with Vertex Networks L4DC 2021

LEOC: A Principled Method in Integrating Reinforcement Learning and Classical Control Theory L4DC 2021

Nested Mixture of Experts: Cooperative and Competitive Learning of Hybrid Dynamical System L4DC 2021

Learning without Knowing: Unobserved Context in Continuous Transfer Reinforcement Learning L4DC 2021

Reward Biased Maximum Likelihood Estimation for Reinforcement Learning L4DC 2021

How Are Learned Perception-Based Controllers Impacted by the Limits of Robust Control? L4DC 2021

Faster Policy Learning with Continuous-Time Gradients L4DC 2021

Safe Reinforcement Learning Using Robust Action Governor L4DC 2021