On the Power of (Approximate) Reward Models for Inference-Time ScalingPublished in ICML 2026, 2026Share on X (formerly Twitter) Facebook LinkedIn Previous Next