reinforcement learning
4122 papers
Also known as
RLVR
HARL
GRPO
RL
PPO
REINFORCE
RFT
DRL
RL NULL
LQR
RLHF
Co-occurring keywords
Papers
Practical Nonisotropic Monte Carlo Sampling in High Dimensions via Determinantal Point Processes
AISTATS 2020
Value Preserving State-Action Abstractions
AISTATS 2020
Sample Complexity of Estimating the Policy Gradient for Nearly Deterministic Dynamical Systems
AISTATS 2020
Task-Completion Dialogue Policy Learning via Monte Carlo Tree Search with Dueling Network
EMNLP 2020
Dynamic Context Selection for Document-level Neural Machine Translation via Reinforcement Learning
EMNLP 2020
Multi-document Summarization with Maximal Marginal Relevance-guided Reinforcement Learning
EMNLP 2020