RLHF

AI/ML Fundamentals

Reinforcement Learning from Human Feedback

Combine supervised learning and RL to fine-tune models using human preferences and reward models for safer/aligned behavior.

Learn more about concepts related to RLHF

Reward Modeling

Learning a reward function from human feedback

Reinforcement Learning

Learning through trial and error with rewards/penalties