reinforcement learning
4122 papers
Also known as
RLVR
HARL
GRPO
RL
PPO
REINFORCE
RFT
DRL
RL NULL
LQR
RLHF
Co-occurring keywords
Papers
DRBO: Mitigating the Bottleneck Effect via Dynamic Reward Balancing in Multi-reward LLM Optimization
EMNLP 2025
Teaching LLMs to Plan, Not Just Solve: Plan Learning Boosts LLMs Generalization in Reasoning Tasks
EMNLP 2025
Training Medical QA Models Based on Mixed Rewards from Multiple-Choice and Open-Ended Questions
EMNLP 2025
MT-R1-Zero: Advancing LLM-based Machine Translation via R1-Zero-like Reinforcement Learning
EMNLP 2025
Teaching Models to Improve on Tape
AAAI 2025
ReAL: How Can LLMs Simulate the Real Teacher? Retrieval-enhanced Agent for Adaptive Learning
EMNLP 2025
GTA: Supervised-Guided Reinforcement Learning for Text Classification with Large Language Models
EMNLP 2025