conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3,861 papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
Meta-Reinforcement Learning for Mastering Multiple Skills and Generalizing across Environments in Text-based Games
IJCNLP 2021
Offline Reinforcement Learning from Human Feedback in Real-World Sequence-to-Sequence Tasks
IJCNLP 2021
Variable Frame Rate Acoustic Models Using Minimum Error Reinforcement Learning
INTERSPEECH 2021
Reinforcement Learning for Emotional Text-to-Speech Synthesis with Improved Emotion Discriminability
INTERSPEECH 2021
Learning and Planning for Time-Varying MDPs Using Maximum Likelihood Estimation
JMLR 2021
Risk-Averse Learning by Temporal Difference Methods with Markov Risk Measures
JMLR 2021
ChainerRL: A Deep Reinforcement Learning Library
JMLR 2021
Safe Policy Iteration: A Monotonically Improving Approximate Policy Iteration Approach
JMLR 2021
MushroomRL: Simplifying Reinforcement Learning Research
JMLR 2021
Hamilton-Jacobi Deep Q-Learning for Deterministic Continuous-Time Systems with Lipschitz Continuous Controls
JMLR 2021
Stable-Baselines3: Reliable Reinforcement Learning Implementations
JMLR 2021
Gaussian Approximation for Bias Reduction in Q-Learning
JMLR 2021
VariBAD: Variational Bayes-Adaptive Deep RL via Meta-Learning
JMLR 2021
On the Model-Based Stochastic Value Gradient for Continuous Reinforcement Learning
L4DC 2021
Estimating Disentangled Belief about Hidden State and Hidden Task for Meta-Reinforcement Learning
L4DC 2021
Learning to Actively Reduce Memory Requirements for Robot Control Tasks
L4DC 2021
Abstraction-based branch and bound approach to Q-learning for hybrid optimal control
L4DC 2021
Safe Reinforcement Learning of Control-Affine Systems with Vertex Networks
L4DC 2021
LEOC: A Principled Method in Integrating Reinforcement Learning and Classical Control Theory
L4DC 2021
Nested Mixture of Experts: Cooperative and Competitive Learning of Hybrid Dynamical System
L4DC 2021
Learning without Knowing: Unobserved Context in Continuous Transfer Reinforcement Learning
L4DC 2021
Reward Biased Maximum Likelihood Estimation for Reinforcement Learning
L4DC 2021
How Are Learned Perception-Based Controllers Impacted by the Limits of Robust Control?
L4DC 2021
Faster Policy Learning with Continuous-Time Gradients
L4DC 2021
Safe Reinforcement Learning Using Robust Action Governor
L4DC 2021
<
1
…
97
98
99
…
155
>