Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Machine Learning
›
Learning Types
›
Reinforcement Learning
2932 directly classified papers
Papers per year
2003: 1
2006: 11
2007: 18
2008: 23
2009: 14
2010: 22
2011: 24
2012: 34
2013: 26
2014: 24
2015: 14
2016: 23
2017: 79
2018: 182
2019: 255
2020: 284
2021: 333
2022: 319
2023: 315
2024: 457
2025: 419
2026: 55
Papers
Unveiling User Satisfaction and Creator Productivity Trade-Offs in Recommendation Platforms
NIPS 2024
FactorSim: Generative Simulation via Factorized Representation
NIPS 2024
RGMDT: Return-Gap-Minimizing Decision Tree Extraction in Non-Euclidean Metric Space
NIPS 2024
Thompson Sampling Itself is Differentially Private
AISTATS 2024
Learning to Stop Cut Generation for Efficient Mixed-Integer Linear Programming
AAAI 2024
Enhancing Off-Policy Constrained Reinforcement Learning through Adaptive Ensemble C Estimation
AAAI 2024
MANDREL: Modular Reinforcement Learning Pipelines for Material Discovery
AAAI 2024
ConsistentEE: A Consistent and Hardness-Guided Early Exiting Method for Accelerating Language Models Inference
AAAI 2024
Aligner²: Enhancing Joint Multiple Intent Detection and Slot Filling via Adjustive and Forced Cross-Task Alignment
AAAI 2024
On Divergence Measures for Training GFlowNets
NIPS 2024
Dialogue for Prompting: A Policy-Gradient-Based Discrete Prompt Generation for Few-Shot Learning
AAAI 2024
Reward Penalties on Augmented States for Solving Richly Constrained RL Effectively
AAAI 2024
Local Linearity: the Key for No-regret Reinforcement Learning in Continuous MDPs
NIPS 2024
CryoRL: Reinforcement Learning Enables Efficient Cryo-EM Data Collection
WACV 2024
Optimizing Local Satisfaction of Long-Run Average Objectives in Markov Decision Processes
AAAI 2024
Stage-Aware Learning for Dynamic Treatments
JMLR 2024
RL in Latent MDPs is Tractable: Online Guarantees via Off-Policy Evaluation
NIPS 2024
Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning
NIPS 2024
Policy Learning from Tutorial Books via Understanding, Rehearsing and Introspecting
NIPS 2024
AlphaMath Almost Zero: Process Supervision without Process
NIPS 2024
Response Enhanced Semi-supervised Dialogue Query Generation
AAAI 2024
Probabilistic Offline Policy Ranking with Approximate Bayesian Computation
AAAI 2024
Risk-sensitive control as inference with Rényi divergence
NIPS 2024
State Chrono Representation for Enhancing Generalization in Reinforcement Learning
NIPS 2024
Causal Imitation for Markov Decision Processes: a Partial Identification Approach
NIPS 2024
<
1
…
26
27
28
…
118
>