Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3861 directly classified papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
Regret Bounds for Risk-sensitive Reinforcement Learning with Lipschitz Dynamic Risk Measures
AISTATS 2024
KEHRL: Learning Knowledge-Enhanced Language Representations with Hierarchical Reinforcement Learning
COLING 2024
Sample Complexity Characterization for Linear Contextual MDPs
AISTATS 2024
Generalizable Policy Improvement via Reinforcement Sampling (Student Abstract)
AAAI 2024
Feasible $Q$-Learning for Average Reward Reinforcement Learning
AISTATS 2024
PopALM: Popularity-Aligned Language Models for Social Media Trendy Response Prediction
COLING 2024
Reinforcement Learning for Edit-Based Non-Autoregressive Neural Machine Translation
NAACL 2024
A Behavior-Aware Approach for Deep Reinforcement Learning in Non-stationary Environments without Known Change Points
IJCAI 2024
Off-Policy Action Anticipation in Multi-Agent Reinforcement Learning
JMLR 2024
Virtual Action Actor-Critic Framework for Exploration (Student Abstract)
AAAI 2024
Isometric Neural Machine Translation using Phoneme Count Ratio Reward-based Reinforcement Learning
NAACL 2024
Central Limit Theorem for Two-Timescale Stochastic Approximation with Markovian Noise: Theory and Applications
AISTATS 2024
An Analysis of Quantile Temporal-Difference Learning
JMLR 2024
Model-Free Representation Learning and Exploration in Low-Rank MDPs
JMLR 2024
Learning Sampling Policy to Achieve Fewer Queries for Zeroth-Order Optimization
AISTATS 2024
Integrating Neural Pathways for Learning in Deep Reinforcement Learning Models
AAAI 2024
On the Sample Complexity and Metastability of Heavy-tailed Policy Search in Continuous Control
JMLR 2024
MANDREL: Modular Reinforcement Learning Pipelines for Material Discovery
AAAI 2024
Multi-world Model in Continual Reinforcement Learning
AAAI 2024
Sample Complexity of Neural Policy Mirror Descent for Policy Optimization on Low-Dimensional Manifolds
JMLR 2024
Towards Achieving Sub-linear Regret and Hard Constraint Violation in Model-free RL
AISTATS 2024
Deep Reinforcement Learning with Hierarchical Action Exploration for Dialogue Generation
COLING 2024
Actor Prioritized Experience Replay (Abstract Reprint)
AAAI 2024
Prior-dependent analysis of posterior sampling reinforcement learning with function approximation
AISTATS 2024
Value-Distributional Model-Based Reinforcement Learning
JMLR 2024
<
1
…
19
20
21
…
155
>