reinforcement learning
4122 papers
Also known as
RLVR
HARL
GRPO
RL
PPO
REINFORCE
RFT
DRL
RL NULL
LQR
RLHF
Co-occurring keywords
Papers
Conversational Question Answering with Language Models Generated Reformulations over Knowledge Graph
ACL 2024
JoTR: A Joint Transformer and Reinforcement Learning Framework for Dialogue Policy Learning
COLING 2024
DDxGym: Online Transformer Policies in a Knowledge Graph Based Natural Language Environment
COLING 2024