Model Alignment
Ethics & Explainability
Ensuring model goals match human intent
What is Model Alignment?
Efforts to make AI objectives and behavior align with human values and safety constraints.
Real-World Examples
- •Avoiding harmful outputs in chatbots
Related Terms
Learn more about concepts related to Model Alignment