Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Machine Learning
›
Learning Types
›
Reinforcement Learning
2932 directly classified papers
Papers per year
2003: 1
2006: 11
2007: 18
2008: 23
2009: 14
2010: 22
2011: 24
2012: 34
2013: 26
2014: 24
2015: 14
2016: 23
2017: 79
2018: 182
2019: 255
2020: 284
2021: 333
2022: 319
2023: 315
2024: 457
2025: 419
2026: 55
Papers
Evidential Stochastic Differential Equations for Time-Aware Sequential Recommendation
NIPS 2024
Improving Context-Aware Preference Modeling for Language Models
NIPS 2024
Sample-Efficient Constrained Reinforcement Learning with General Parameterization
NIPS 2024
Fine Tuning Out-of-Vocabulary Item Recommendation with User Sequence Imagination
NIPS 2024
No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO
NIPS 2024
The Sample-Communication Complexity Trade-off in Federated Q-Learning
NIPS 2024
RL on Incorrect Synthetic Data Scales the Efficiency of LLM Math Reasoning by Eight-Fold
NIPS 2024
Enhancing Chess Reinforcement Learning with Graph Representation
NIPS 2024
Adaptive Labeling for Efficient Out-of-distribution Model Evaluation
NIPS 2024
Near-Optimal Distributionally Robust Reinforcement Learning with General $L_p$ Norms
NIPS 2024
Taming "data-hungry" reinforcement learning? Stability in continuous state-action spaces
NIPS 2024
Risk-sensitive control as inference with Rényi divergence
NIPS 2024
State Chrono Representation for Enhancing Generalization in Reinforcement Learning
NIPS 2024
Sustainability of Data Center Digital Twins with Reinforcement Learning
AAAI 2024
Learning to Navigate Efficiently and Precisely in Real Environments
CVPR 2024
NeoRL: Efficient Exploration for Nonepisodic RL
NIPS 2024
Exploration by Learning Diverse Skills through Successor State Representations
NIPS 2024
AlignSAM: Aligning Segment Anything Model to Open Context via Reinforcement Learning
CVPR 2024
On Divergence Measures for Training GFlowNets
NIPS 2024
Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning
CVPR 2024
Amortizing intractable inference in diffusion models for vision, language, and control
NIPS 2024
Let Me Teach You: Pedagogical Foundations of Feedback for Language Models
EMNLP 2024
ORPO: Monolithic Preference Optimization without Reference Model
EMNLP 2024
Is Value Learning Really the Main Bottleneck in Offline RL?
NIPS 2024
BPO: Staying Close to the Behavior LLM Creates Better Online LLM Alignment
EMNLP 2024
<
1
…
33
34
35
…
118
>