AI Safety

Ethics & Explainability

Designing AI systems to avoid harmful behavior

Includes robust testing, alignment research, and deployment safeguards to mitigate risks from misuse or unexpected behavior.

Learn more about concepts related to AI Safety

Model Alignment

Ensuring model goals match human intent