RLAIF (Reinforcement Learning from AI Feedback)

Appears in 1 paper

The stage of Constitutional AI where an AI (rather than a human) provides feedback on which response better follows the constitution.

As used in Paper 22 — Constitutional AI: Harmlessness from AI Feedback →

The stage of Constitutional AI where an AI (rather than a human) provides feedback on which response better follows the constitution. The feedback is used to train a reward model, which is then used for RL optimization.

Paper 22 — Constitutional AI: Harmlessness from AI Feedback →

Appears in papers