Red Teaming
Security
Adversarial testing to find model failures
What is Red Teaming?
Simulate attacks and misuse to discover vulnerabilities and unsafe behaviors before public release.
Real-World Examples
- •Prompt-based jailbreak attempts on LLMs
Related Terms
Learn more about concepts related to Red Teaming