Back-propagation (MCTS)

Appears in 1 paper

The process of updating node statistics (visit counts, accumulated rewards) as you trace back from a leaf node to the root after a rollout.

As used in Paper 24 — rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking →

The process of updating node statistics (visit counts, accumulated rewards) as you trace back from a leaf node to the root after a rollout. All nodes on the path get updated with the rollout's outcome.