Human Feedback (HF)
Labels provided by humans comparing two AI outputs and indicating which one is better.
Labels provided by humans comparing two AI outputs and indicating which one is better. In RLHF (Paper 15), human feedback is used to train reward models. Constitutional AI aims to replace this with AI feedback.