reinforcement learning
4122 papers
Also known as
RLVR
HARL
GRPO
RL
PPO
REINFORCE
RFT
DRL
RL NULL
LQR
RLHF
Co-occurring keywords
Papers
ToolPlanner: A Tool Augmented LLM for Multi Granularity Instructions with Path Planning and Feedback
EMNLP 2024
Weak Reward Model Transforms Generative Models into Robust Causal Event Extraction Systems
EMNLP 2024
Excluding the Irrelevant: Focusing Reinforcement Learning through Continuous Action Masking
NIPS 2024
Learning to Trust Your Feelings: Leveraging Self-awareness in LLMs for Hallucination Mitigation
ACL 2024