reinforcement learning
4122 papers
Also known as
RLVR
HARL
GRPO
RL
PPO
REINFORCE
RFT
DRL
RL NULL
LQR
RLHF
Co-occurring keywords
Papers
Smart-Searcher: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning
EMNLP 2025
Breaking the Self-Evaluation Barrier: Reinforced Neuro-Symbolic Planning with Large Language Models
IJCAI 2025
GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training
ICCV 2025
Cross-Validated Off-Policy Evaluation
AAAI 2025
Teaching Models to Improve on Tape
AAAI 2025