ToolPRM: Fine-Grained Inference Scaling of Structured Outputs for Function Calling

Jianghao Lin; Yuanyuan Shi; Xin Peng; Renjie Ding; Hairui Wang; Yuxuan Peng; Bizhe Bai; Weixi Song; Fengshuo Bai; Huacan Chai; Weinan Zhang; Fei Huang; Ying Wen

2026 ACL ACL 2026

ToolPRM: Fine-Grained Inference Scaling of Structured Outputs for Function Calling

Abstract

AbstractLarge language models (LLMs) excel at function calling, but inference scaling has been explored mainly for unstructured generation. We propose an inference-scaling framework for structured outputs that combines fine-grained beam search with ToolPRM, a process reward model scoring each intra-call decision (function name and argument filling). We build the first fine-grained intra-call supervision dataset via function masking, rollout collection, and step-level annotation. ToolPRM outperforms outcome and coarse-grained reward models in predictive accuracy and yields consistent test-time gains on multiple function-calling benchmarks. We further show that structured generation follows “explore more but retain less”, since early JSON errors are unrecoverable.

Authors

Jianghao Lin , Yuanyuan Shi , Xin Peng , Renjie Ding , Hairui Wang , Yuxuan Peng , Bizhe Bai , Weixi Song , Fengshuo Bai , Huacan Chai , Weinan Zhang , Fei Huang , Ying Wen

Topics

Artificial Intelligence > Core AI > Large Language Models Artificial Intelligence > Core AI > Reasoning Deep Learning > Learning Types > Code Generation

Keywords

structured output generation beam search function calling process reward model inference-time scaling

Download PDF

Related papers

No Reader Left Behind: Multi-Agent Summaries Everyone Can Understand 2026

One-step Nonautoregressive Natural Language Generation with Shortcut Flow Matching Models 2026

Optimizing Retrieval-Augmented Generation for E-Commerce How-To Assistance 2026

Make Mechanistic Interpretability Auditable: A Call to Develop Guidelines via Continuous Collaborative Reviewing 2026

MQM Re-Annotation: A Technique for Collaborative Evaluation of Machine Translation 2026