Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Machine Learning
›
Learning Types
›
Reinforcement Learning
2932 directly classified papers
Papers per year
2003: 1
2006: 11
2007: 18
2008: 23
2009: 14
2010: 22
2011: 24
2012: 34
2013: 26
2014: 24
2015: 14
2016: 23
2017: 79
2018: 182
2019: 255
2020: 284
2021: 333
2022: 319
2023: 315
2024: 457
2025: 419
2026: 55
Papers
DACO: Towards Application-Driven and Comprehensive Data Analysis via Code Generation
NIPS 2024
QGym: Scalable Simulation and Benchmarking of Queuing Network Controllers
NIPS 2024
Mimicking To Dominate: Imitation Learning Strategies for Success in Multiagent Games
NIPS 2024
Exploiting the Replay Memory Before Exploring the Environment: Enhancing Reinforcement Learning Through Empirical MDP Iteration
NIPS 2024
Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in LLMs
NIPS 2024
Global Rewards in Restless Multi-Armed Bandits
NIPS 2024
BertRLFuzzer: A BERT and Reinforcement Learning Based Fuzzer (Student Abstract)
AAAI 2024
Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees
NIPS 2024
Solving Minimum-Cost Reach Avoid using Reinforcement Learning
NIPS 2024
Learning the Optimal Policy for Balancing Short-Term and Long-Term Rewards
NIPS 2024
Unveiling the Significance of Toddler-Inspired Reward Transition in Goal-Oriented Reinforcement Learning
AAAI 2024
A Survey on Multi-player Bandits
JMLR 2024
Measuring Mutual Policy Divergence for Multi-Agent Sequential Exploration
NIPS 2024
Learning to Discuss Strategically: A Case Study on One Night Ultimate Werewolf
NIPS 2024
OASIS: Conditional Distribution Shaping for Offline Safe Reinforcement Learning
NIPS 2024
AuctionNet: A Novel Benchmark for Decision-Making in Large-Scale Games
NIPS 2024
Nearest Neighbor Speculative Decoding for LLM Generation and Attribution
NIPS 2024
Two-way Deconfounder for Off-policy Evaluation in Causal Reinforcement Learning
NIPS 2024
Multimodal Large Language Models Make Text-to-Image Generative Models Align Better
NIPS 2024
Trace is the Next AutoDiff: Generative Optimization with Rich Feedback, Execution Traces, and LLMs
NIPS 2024
Disentangled Unsupervised Skill Discovery for Efficient Hierarchical Reinforcement Learning
NIPS 2024
Achieving Stronger Generation via Simple Contrastive Tuning
EMNLP 2024
Randomized Exploration for Reinforcement Learning with Multinomial Logistic Function Approximation
NIPS 2024
The Value of Reward Lookahead in Reinforcement Learning
NIPS 2024
RL-SeqISP: Reinforcement Learning-Based Sequential Optimization for Image Signal Processing
AAAI 2024
<
1
…
36
37
38
…
118
>