Co-occurring keywords
Papers
Rating-Based Reinforcement Learning
AAAI 2024
Global Reward to Local Rewards: Multimodal-Guided Decomposition for Improving Dialogue Agents
EMNLP 2024
Aligning Factual Consistency for Clinical Studies Summarization through Reinforcement Learning
ACL 2023
Loose lips sink ships: Mitigating Length Bias in Reinforcement Learning from Human Feedback
EMNLP 2023