Co-occurring keywords
Papers
Smart-Searcher: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning
EMNLP 2025
Breaking the Self-Evaluation Barrier: Reinforced Neuro-Symbolic Planning with Large Language Models
IJCAI 2025
GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training
ICCV 2025
Cross-Validated Off-Policy Evaluation
AAAI 2025
Teaching Models to Improve on Tape
AAAI 2025