Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Offline RL
725 directly classified papers
Papers per year
2007: 2
2008: 1
2009: 1
2011: 1
2012: 2
2014: 3
2015: 2
2016: 6
2017: 4
2018: 8
2019: 29
2020: 60
2021: 105
2022: 129
2023: 187
2024: 126
2025: 37
2026: 22
Papers
Leveraging Factored Action Spaces for Efficient Offline Reinforcement Learning in Healthcare
NIPS 2022
Provably Efficient Offline Multi-agent Reinforcement Learning via Strategy-wise Bonus
NIPS 2022
Grounding Aleatoric Uncertainty for Unsupervised Environment Design
NIPS 2022
Robust Reinforcement Learning using Offline Data
NIPS 2022
IMO^3: Interactive Multi-Objective Off-Policy Optimization
IJCAI 2022
Model-Based Offline Planning with Trajectory Pruning
IJCAI 2022
Supported Policy Optimization for Offline Reinforcement Learning
NIPS 2022
On the Effect of Pre-training for Transformer in Different Modality on Offline Reinforcement Learning
NIPS 2022
Off-Policy Evaluation with Deficient Support Using Side Information
NIPS 2022
Oracle Inequalities for Model Selection in Offline Reinforcement Learning
NIPS 2022
When are Offline Two-Player Zero-Sum Markov Games Solvable?
NIPS 2022
Improving Zero-Shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions
NIPS 2022
Bayesian Nonparametrics for Offline Skill Discovery
ICML 2022
Addressing Optimism Bias in Sequence Modeling for Reinforcement Learning
ICML 2022
Interpretable Off-Policy Learning via Hyperbox Search
ICML 2022
Pessimistic Q-Learning for Offline Reinforcement Learning: Towards Optimal Sample Complexity
ICML 2022
Offline Meta-Reinforcement Learning with Online Self-Supervision
ICML 2022
Constrained Offline Policy Optimization
ICML 2022
Plan Better Amid Conservatism: Offline Multi-Agent Reinforcement Learning with Actor Rectification
ICML 2022
Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching
ICML 2022
Pessimism meets VCG: Learning Dynamic Mechanism Design via Offline Reinforcement Learning
ICML 2022
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning
ICML 2022
Model Selection in Batch Policy Optimization
ICML 2022
Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters
ICML 2022
Doubly Robust Distributionally Robust Off-Policy Evaluation and Learning
ICML 2022
<
1
…
15
16
17
…
29
>