Papers
36 papers found
Think Right, Not More: Test-Time Scaling for Numerical Claim Verification
Primakov Chungkham, Venktesh V, Vinay Setty et al.
Thinking Before You Speak: A Proactive Test-time Scaling Approach
Cong Liu, Wenchang Chai, Hejun Wu et al.
Evaluating the Role of Verifiers in Test-Time Scaling for Legal Reasoning Tasks
Davide Romano, Jonathan Richard Schwarz, Daniele Giofrè
BoN Appetit Team at LeWiDi-2025: Best-of-N Test-time Scaling Can Not Stomach Annotation Disagreements (Yet)
Tomas Ruiz, Siyao Peng, Barbara Plank et al.
Video-T1: Test-time Scaling for Video Generation
Fangfu Liu, Hanyang Wang, Yimo Cai et al.
Visual Test-time Scaling for GUI Agent Grounding
Tiange Luo, Lajanugen Logeswaran, Justin Johnson et al.
Learning a Continue-Thinking Token for Enhanced Test-Time Scaling
Liran Ringel, Elad Tolochinsky, Yaniv Romano
Test-Time Scaling of Reasoning Models for Machine Translation
Zihao Li, Shaoxiong Ji, Jörg Tiedemann
Thinking Long, but Short: Stable Sequential Test-Time Scaling for Large Reasoning Models
Michael R. Metel, Yufei Cui, Boxing Chen et al.
From Mathematical Reasoning to Code: Generalization of Process Reward Models in Test-Time Scaling
Zhengyu Chen, Yudong Wang, Teng Xiao et al.
SCALE: Selective Resource Allocation for Overcoming Performance Bottlenecks in Mathematical Test-time Scaling
Yang Xiao, Chunpu Xu, Ruifeng Yuan et al.