Co-occurring keywords
Papers
R3: End-to-End Reasoning-based Planning for Multi-step Retrosynthesis via Reinforcement Learning
ACL 2026
Sparse-RL: Breaking the Memory Wall in LLM Reinforcement Learning via Stable Sparse Rollouts
ACL 2026