Entropy Regularization
A term in RL that encourages exploration by rewarding policy entropy (randomness).
A term in RL that encourages exploration by rewarding policy entropy (randomness). High entropy = exploring many actions; low entropy = committing to a few. RLHF uses entropy implicitly through the KL penalty, which discourages collapsing to a deterministic policy.