Pass@K
A metric that evaluates whether at least one out of K generated solutions is correct.
A metric that evaluates whether at least one out of K generated solutions is correct. Used to measure the effectiveness of sampling-based approaches like Best-of-N. Pass@K is directly related to the formula 1 - (1-p)^K.