Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Machine Learning
›
Learning Types
›
Reinforcement Learning
2932 directly classified papers
Papers per year
2003: 1
2006: 11
2007: 18
2008: 23
2009: 14
2010: 22
2011: 24
2012: 34
2013: 26
2014: 24
2015: 14
2016: 23
2017: 79
2018: 182
2019: 255
2020: 284
2021: 333
2022: 319
2023: 315
2024: 457
2025: 419
2026: 55
Papers
Unsupervised Object Interaction Learning with Counterfactual Dynamics Models
AAAI 2024
Beyond OOD State Actions: Supported Cross-Domain Offline Reinforcement Learning
AAAI 2024
Hierarchical Planning and Learning for Robots in Stochastic Settings Using Zero-Shot Option Invention
AAAI 2024
Sample Efficient Reinforcement Learning with Partial Dynamics Knowledge
AAAI 2024
OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments
AAAI 2024
Distributionally Robust Reinforcement Learning with Interactive Data Collection: Fundamental Hardness and Near-Optimal Algorithms
NIPS 2024
Distributionally Robust Model-Based Offline Reinforcement Learning with Near-Optimal Sample Complexity
JMLR 2024
Bridging Distributional and Risk-sensitive Reinforcement Learning with Provable Regret Bounds
JMLR 2024
A Reinforcement-Learning-Based Multiple-Column Selection Strategy for Column Generation
AAAI 2024
Online Markov Decision Processes Configuration with Continuous Decision Space
AAAI 2024
Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations
AAAI 2024
Settling Decentralized Multi-Agent Coordinated Exploration by Novelty Sharing
AAAI 2024
DiffPhyCon: A Generative Approach to Control Complex Physical Systems
NIPS 2024
When Your AIs Deceive You: Challenges of Partial Observability in Reinforcement Learning from Human Feedback
NIPS 2024
RS-DPO: A Hybrid Rejection Sampling and Direct Preference Optimization Method for Alignment of Large Language Models
NAACL 2024
A Nearly Optimal and Low-Switching Algorithm for Reinforcement Learning with General Function Approximation
NIPS 2024
Deterministic Policies for Constrained Reinforcement Learning in Polynomial Time
NIPS 2024
Log Barriers for Safe Black-box Optimization with Application to Safe Reinforcement Learning
JMLR 2024
Reinforcement Learning with Adaptive Regularization for Safe Control of Critical Systems
NIPS 2024
Controlling Character Motions Without Observable Driving Source
WACV 2024
Imitating Language via Scalable Inverse Reinforcement Learning
NIPS 2024
Decentralized Natural Policy Gradient with Variance Reduction for Collaborative Multi-Agent Reinforcement Learning
JMLR 2024
Graph Diffusion Policy Optimization
NIPS 2024
Causal Imitation for Markov Decision Processes: a Partial Identification Approach
NIPS 2024
Speculative Monte-Carlo Tree Search
NIPS 2024
<
1
…
20
21
22
…
118
>