Constitutional AI (CAI)
An alignment methodology that replaces human feedback with AI feedback.
An alignment methodology that replaces human feedback with AI feedback. It consists of SL-CAI (supervised learning stage using self-critique and revision) and RL-CAI (reinforcement learning stage using AI-generated preferences).