Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Machine Learning
›
Learning Types
›
Reinforcement Learning
2932 directly classified papers
Papers per year
2003: 1
2006: 11
2007: 18
2008: 23
2009: 14
2010: 22
2011: 24
2012: 34
2013: 26
2014: 24
2015: 14
2016: 23
2017: 79
2018: 182
2019: 255
2020: 284
2021: 333
2022: 319
2023: 315
2024: 457
2025: 419
2026: 55
Papers
QGFN: Controllable Greediness with Action Values
NIPS 2024
Responsible Bandit Learning via Privacy-Protected Mean-Volatility Utility
AAAI 2024
Can You Rely on Synthetic Labellers in Preference-Based Reinforcement Learning? It’s Complicated
AAAI 2024
Learning Optimal Advantage from Preferences and Mistaking It for Reward
AAAI 2024
Variational Delayed Policy Optimization
NIPS 2024
WorldCoder, a Model-Based LLM Agent: Building World Models by Writing Code and Interacting with the Environment
NIPS 2024
Explaining Reinforcement Learning Agents through Counterfactual Action Outcomes
AAAI 2024
Overcoming the Sim-to-Real Gap: Leveraging Simulation to Learn to Explore for Real-World RL
NIPS 2024
A/B Testing and Best-arm Identification for Linear Bandits with Robustness to Non-stationarity
AISTATS 2024
Automated Design of Affine Maximizer Mechanisms in Dynamic Settings
AAAI 2024
Sequential Decision Making with Expert Demonstrations under Unobserved Heterogeneity
NIPS 2024
Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms
NIPS 2024
Rethinking Discount Regularization: New Interpretations, Unintended Consequences, and Solutions for Regularization in Reinforcement Learning
JMLR 2024
Active Reinforcement Learning for Robust Building Control
AAAI 2024
Reward (Mis)design for Autonomous Driving (Abstract Reprint)
AAAI 2024
Model-Free Representation Learning and Exploration in Low-Rank MDPs
JMLR 2024
Learning from Ambiguous Demonstrations with Self-Explanation Guided Reinforcement Learning
AAAI 2024
Learning Encodings for Constructive Neural Combinatorial Optimization Needs to Regret
AAAI 2024
Task Planning for Object Rearrangement in Multi-Room Environments
AAAI 2024
Effect-Invariant Mechanisms for Policy Generalization
JMLR 2024
Constrained Meta-Reinforcement Learning for Adaptable Safety Guarantee with Differentiable Convex Programming
AAAI 2024
Probabilistic Offline Policy Ranking with Approximate Bayesian Computation
AAAI 2024
DiG-In-GNN: Discriminative Feature Guided GNN-Based Fraud Detector against Inconsistencies in Multi-Relation Fraud Graph
AAAI 2024
Limited Query Graph Connectivity Test
AAAI 2024
UNEX-RL: Reinforcing Long-Term Rewards in Multi-Stage Recommender Systems with UNidirectional EXecution
AAAI 2024
<
1
…
24
25
26
…
118
>