Inference-Time Scaling
The broader principle of improving model performance by allocating more compute at inference time, rather than only at training time.
The broader principle of improving model performance by allocating more compute at inference time, rather than only at training time. Test-time compute is a form of inference-time scaling.