Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Machine Learning
›
Learning Types
›
Reinforcement Learning
2932 directly classified papers
Papers per year
2003: 1
2006: 11
2007: 18
2008: 23
2009: 14
2010: 22
2011: 24
2012: 34
2013: 26
2014: 24
2015: 14
2016: 23
2017: 79
2018: 182
2019: 255
2020: 284
2021: 333
2022: 319
2023: 315
2024: 457
2025: 419
2026: 55
Papers
Exploiting Action Impact Regularity and Exogenous State Variables for Offline Reinforcement Learning (Abstract Reprint)
AAAI 2024
When LLM Meets DRL: Advancing Jailbreaking Efficiency via DRL-guided Search
NIPS 2024
AlphaMath Almost Zero: Process Supervision without Process
NIPS 2024
MANDREL: Modular Reinforcement Learning Pipelines for Material Discovery
AAAI 2024
A2PO: Towards Effective Offline Reinforcement Learning from an Advantage-aware Perspective
NIPS 2024
Scaling Offline Evaluation of Reinforcement Learning Agents through Abstraction
AAAI 2024
Model-free Low-Rank Reinforcement Learning via Leveraged Entry-wise Matrix Estimation
NIPS 2024
Reward (Mis)design for Autonomous Driving (Abstract Reprint)
AAAI 2024
Sustainability of Data Center Digital Twins with Reinforcement Learning
AAAI 2024
Time-Constrained Robust MDPs
NIPS 2024
Interpreting Learned Feedback Patterns in Large Language Models
NIPS 2024
Responsible Bandit Learning via Privacy-Protected Mean-Volatility Utility
AAAI 2024
Amortized Active Causal Induction with Deep Reinforcement Learning
NIPS 2024
Safe Reinforcement Learning with Instantaneous Constraints: The Role of Aggressive Exploration
AAAI 2024
Action Gaps and Advantages in Continuous-Time Distributional Reinforcement Learning
NIPS 2024
Online Control with Adversarial Disturbance for Continuous-time Linear Systems
NIPS 2024
Learning Successor Features the Simple Way
NIPS 2024
Predicting Future Actions of Reinforcement Learning Agents
NIPS 2024
Doubly Mild Generalization for Offline Reinforcement Learning
NIPS 2024
REBEL: Reinforcement Learning via Regressing Relative Rewards
NIPS 2024
Personalizing Reinforcement Learning from Human Feedback with Variational Preference Learning
NIPS 2024
Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing
NIPS 2024
Adversarially Trained Weighted Actor-Critic for Safe Offline Reinforcement Learning
NIPS 2024
Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization
NIPS 2024
Enhancing Off-Policy Constrained Reinforcement Learning through Adaptive Ensemble C Estimation
AAAI 2024
<
1
…
31
32
33
…
118
>