reinforcement learning
4122 papers
Also known as
RLVR
HARL
GRPO
RL
PPO
REINFORCE
RFT
DRL
RL NULL
LQR
RLHF
Co-occurring keywords
Papers
Designs for Enabling Collaboration in Human-Machine Teaming via Interactive and Explainable Systems
NIPS 2024
Expectation Alignment: Handling Reward Misspecification in the Presence of Expectation Mismatch
NIPS 2024
Isometric Neural Machine Translation using Phoneme Count Ratio Reward-based Reinforcement Learning
NAACL 2024