Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Offline RL
725 directly classified papers
Papers per year
2007: 2
2008: 1
2009: 1
2011: 1
2012: 2
2014: 3
2015: 2
2016: 6
2017: 4
2018: 8
2019: 29
2020: 60
2021: 105
2022: 129
2023: 187
2024: 126
2025: 37
2026: 22
Papers
On the Role of Discount Factor in Offline Reinforcement Learning
ICML 2022
Off-Policy Reinforcement Learning with Delayed Rewards
ICML 2022
DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning
AAAI 2022
Offline Reinforcement Learning as Anti-exploration
AAAI 2022
Controlling Underestimation Bias in Reinforcement Learning via Quasi-median Operation
AAAI 2022
Towards Off-Policy Learning for Ranking Policies with Logged Feedback
AAAI 2022
State Deviation Correction for Offline Reinforcement Learning
AAAI 2022
Text-Based Interactive Recommendation via Offline Reinforcement Learning
AAAI 2022
Bayesian Model-Based Offline Reinforcement Learning for Product Allocation
AAAI 2022
NeoRL: A Near Real-World Benchmark for Offline Reinforcement Learning
NIPS 2022
Bellman Residual Orthogonalization for Offline Reinforcement Learning
NIPS 2022
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing
NIPS 2022
Large-Scale Retrieval for Reinforcement Learning
NIPS 2022
Uncertainty Estimation Using Riemannian Model Dynamics for Offline Reinforcement Learning
NIPS 2022
Why So Pessimistic? Estimating Uncertainties for Offline RL through Ensembles, and Why Their Independence Matters
NIPS 2022
A Unified Framework for Alternating Offline Model Training and Policy Learning
NIPS 2022
RAMBO-RL: Robust Adversarial Model-Based Offline Reinforcement Learning
NIPS 2022
On Gap-dependent Bounds for Offline Reinforcement Learning
NIPS 2022
Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data
NIPS 2022
Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification
NIPS 2022
A Closer Look at Offline RL Agents
NIPS 2022
Off-Policy Evaluation for Action-Dependent Non-stationary Environments
NIPS 2022
A Near-Optimal Primal-Dual Method for Off-Policy Learning in CMDP
NIPS 2022
S2P: State-conditioned Image Synthesis for Data Augmentation in Offline Reinforcement Learning
NIPS 2022
Offline reinforcement learning under value and density-ratio realizability: The power of gaps
UAI 2022
<
1
…
16
17
18
…
29
>