conftrace_

Runji Lin

8 papers · 2022–2025 · 5 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+2 more ↓

🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird 🌍 Conference Polyglot (5) 🐝 Cross-Pollinator (14) 🌈 Renaissance Researcher (5)

🗺️ Taxonomy Completionist (19) 🧭 Keyword Pioneer

Conferences

ACL (3) NIPS (2) ICLR (1) ICML (1) NAACL (1)

Top co-authors

Junyang Lin (5) Keming Lu (5) Jingren Zhou (4) Chang Zhou (3) Bowen Yu (3) Dayiheng Liu (3) Zheng Yuan (2) Beichen Zhang (2) Hongyi Yuan (2) Jun Wang (2)

Keywords

reinforcement learning (3) reward model (2) mathematical reasoning (2) process reward model (2) offline reinforcement learning (1) monte carlo estimation (1) on-policy learning (1) sequence model (1) inference efficiency (1) expert routing (1) encoder-decoder architecture (1) text-based game (1) critique model (1) chain of thought (1) critic model (1) error identification (1) large language model (1) llm ensemble (1) real-time strategy (1) multi-agent system (1)

Papers

LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Feedback ACL 2025 ProcessBench: Identifying Process Errors in Mathematical Reasoning ACL 2025 MARGE: Improving Math Reasoning with Guided Exploration ICML 2025 The Lessons of Developing Process Reward Models in Mathematical Reasoning ACL 2025 Routing to the Expert: Efficient Reward-guided Ensemble of Large Language Models NAACL 2024 Large Language Models Play StarCraft II:Benchmarks and A Chain of Summarization Approach NIPS 2024 #InsTag: Instruction Tagging for Analyzing Supervised Fine-tuning of Large Language Models ICLR 2024 Multi-Agent Reinforcement Learning is a Sequence Modeling Problem NIPS 2022