conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3,861 papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
SEAGuL: Sample Efficient Adversarially Guided Learning of Value Functions
L4DC 2021
Fast Stochastic Kalman Gradient Descent for Reinforcement Learning
L4DC 2021
Robust Reinforcement Learning: A Constrained Game-theoretic Approach
L4DC 2021
Revisiting the Weaknesses of Reinforcement Learning for Neural Machine Translation
NAACL 2021
Imperfect also Deserves Reward: Multi-Level and Sequential Reward Modeling for Better Dialog Management
NAACL 2021
ER-AE: Differentially Private Text Generation for Authorship Anonymization
NAACL 2021
Quantitative Day Trading from Natural Language using Reinforcement Learning
NAACL 2021
Ad Headline Generation using Self-Critical Masked Language Model
NAACL 2021
Alohamora: Reviving HTTP/2 Push and Preload by Adapting Policies On the Fly
NSDI 2021
SENSEI: Aligning Video Streaming Quality with Dynamic User Sensitivity
NSDI 2021
One Protocol to Rule Them All: Wireless Network-on-Chip using Deep Reinforcement Learning
NSDI 2021
Hierarchical Neural Dynamic Policies
RSS 2021
MAGIC: Learning Macro-Actions for Online POMDP Planning
RSS 2021
Composable Energy Policies for Reactive Motion Generation and Reinforcement Learning
RSS 2021
Blind Bipedal Stair Traversal via Sim-to-Real Reinforcement Learning
RSS 2021
HJB-RL: Initializing Reinforcement Learning with Optimal Control Policies Applied to Autonomous Drone Racing
RSS 2021
Robust Multi-Modal Policies for Industrial Assembly via Reinforcement Learning and Demonstrations: A Large-Scale Study
RSS 2021
Formal verification of neural networks for safety-critical tasks in deep reinforcement learning
UAI 2021
Action redundancy in reinforcement learning
UAI 2021
Escaping from zero gradient: Revisiting action-constrained reinforcement learning via Frank-Wolfe policy optimization
UAI 2021
Unsupervised program synthesis for images by sampling without replacement
UAI 2021
Finite-time theory for momentum Q-learning
UAI 2021
Towards tractable optimism in model-based reinforcement learning
UAI 2021
Investigating vulnerabilities of deep neural policies
UAI 2021
Contextual policy transfer in reinforcement learning domains via deep mixtures-of-experts
UAI 2021
<
1
…
98
99
100
…
155
>