On the Power of (Approximate) Reward Models for Inference-Time Scaling

Published in ICML 2026, 2026