Runji Lin
8 papers · 2022–2025 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
π Interdisciplinary Bridge π£ Hot Topic Early Bird π Conference Polyglot (5) π Cross-Pollinator (14) π Renaissance Researcher (5)
πΊοΈ
Taxonomy Completionist
(19)
π§
Keyword Pioneer
Conferences
ACL (3)
NIPS (2)
ICLR (1)
ICML (1)
NAACL (1)
Top co-authors
Keywords
reinforcement learning
(3)
reward model
(2)
mathematical reasoning
(2)
process reward model
(2)
offline reinforcement learning
(1)
monte carlo estimation
(1)
on-policy learning
(1)
sequence model
(1)
inference efficiency
(1)
expert routing
(1)
encoder-decoder architecture
(1)
text-based game
(1)
critique model
(1)
chain of thought
(1)
critic model
(1)
error identification
(1)
large language model
(1)
llm ensemble
(1)
real-time strategy
(1)
multi-agent system
(1)
Papers
LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Feedback
ACL 2025
ProcessBench: Identifying Process Errors in Mathematical Reasoning
ACL 2025
MARGE: Improving Math Reasoning with Guided Exploration
ICML 2025
The Lessons of Developing Process Reward Models in Mathematical Reasoning
ACL 2025
Routing to the Expert: Efficient Reward-guided Ensemble of Large Language Models
NAACL 2024
Large Language Models Play StarCraft II:Benchmarks and A Chain of Summarization Approach
NIPS 2024
#InsTag: Instruction Tagging for Analyzing Supervised Fine-tuning of Large Language Models
ICLR 2024
Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
NIPS 2022