Rejection Sampling

Appears in 1 paper

A data generation strategy: generate many candidate solutions, keep only the correct ones, discard the rest.

As used in Paper 24 — rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking →

A data generation strategy: generate many candidate solutions, keep only the correct ones, discard the rest. Less efficient than MCTS but simpler to implement.