Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3861 directly classified papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
An Analytical Study of Utility Functions in Multi-Objective Reinforcement Learning
NIPS 2024
SkiLD: Unsupervised Skill Discovery Guided by Factor Interactions
NIPS 2024
Two-way Deconfounder for Off-policy Evaluation in Causal Reinforcement Learning
NIPS 2024
Learning to Discuss Strategically: A Case Study on One Night Ultimate Werewolf
NIPS 2024
RL in Latent MDPs is Tractable: Online Guarantees via Off-Policy Evaluation
NIPS 2024
Reinforcement Learning for Edit-Based Non-Autoregressive Neural Machine Translation
NAACL 2024
Bit_numeval at SemEval-2024 Task 7: Enhance Numerical Sensitivity and Reasoning Completeness for Quantitative Understanding
NAACL 2024
Randomized Exploration in Cooperative Multi-Agent Reinforcement Learning
NIPS 2024
Compositional Automata Embeddings for Goal-Conditioned Reinforcement Learning
NIPS 2024
Local Linearity: the Key for No-regret Reinforcement Learning in Continuous MDPs
NIPS 2024
Taming "data-hungry" reinforcement learning? Stability in continuous state-action spaces
NIPS 2024
Speculative Monte-Carlo Tree Search
NIPS 2024
Learning to Assist Humans without Inferring Rewards
NIPS 2024
Disentangled Unsupervised Skill Discovery for Efficient Hierarchical Reinforcement Learning
NIPS 2024
Flipping-based Policy for Chance-Constrained Markov Decision Processes
NIPS 2024
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search
NIPS 2024
Provably and Practically Efficient Adversarial Imitation Learning with General Function Approximation
NIPS 2024
OptEx: Expediting First-Order Optimization with Approximately Parallelized Iterations
NIPS 2024
Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy
NIPS 2024
Randomized Exploration for Reinforcement Learning with Multinomial Logistic Function Approximation
NIPS 2024
Towards a Zero-Data, Controllable, Adaptive Dialog System
COLING 2024
An Approach towards Unsupervised Text Simplification on Paragraph-Level for German Texts
COLING 2024
KEHRL: Learning Knowledge-Enhanced Language Representations with Hierarchical Reinforcement Learning
COLING 2024
Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs
NIPS 2024
PopALM: Popularity-Aligned Language Models for Social Media Trendy Response Prediction
COLING 2024
<
1
…
23
24
25
…
155
>