reinforcement learning
4122 papers
Also known as
RLVR
HARL
GRPO
RL
PPO
REINFORCE
RFT
DRL
RL NULL
LQR
RLHF
Co-occurring keywords
Papers
Flexible Thinking for Multimodal Emotional Support Conversation via Reinforcement Learning
EMNLP 2025
PRED: Performance-oriented Random Early Detection for Consistently Stable Performance in Datacenters
NSDI 2025
KERLQA: Knowledge-Enhanced Reinforcement Learning for Question Answering in Low-resource Languages
IJCNLP 2025
DRBO: Mitigating the Bottleneck Effect via Dynamic Reward Balancing in Multi-reward LLM Optimization
EMNLP 2025
MT-R1-Zero: Advancing LLM-based Machine Translation via R1-Zero-like Reinforcement Learning
EMNLP 2025