offline reinforcement learning
492 papers
Also known as
MORL
O2O
CQL-PRIOR
OSRL
ORDC
OFFLINE RL
RVS
OMRL
OOD
ORL
RL
Co-occurring keywords
Papers
In-Dataset Trajectory Return Regularization for Offline Preference-based Reinforcement Learning
AAAI 2025
Unearthing Gems from Stones: Policy Optimization with Negative Sample Augmentation for LLM Reasoning
EMNLP 2025