Compute-Optimal Strategy
The choice of which inference-time strategy (Best-of-N vs.
The choice of which inference-time strategy (Best-of-N vs. sequential revision, and how many attempts or rounds) maximises accuracy for a given token budget. The optimal strategy depends on problem difficulty and whether feedback from one attempt helps subsequent attempts.