AI Safety
Ethics & Explainability
Designing AI systems to avoid harmful behavior
What is AI Safety?
Includes robust testing, alignment research, and deployment safeguards to mitigate risks from misuse or unexpected behavior.
Real-World Examples
- •Red-teaming models
- •Safety layers in deployment
Related Terms
Learn more about concepts related to AI Safety