APromptRiskDBThreat intelligence atlas
AI Risk

Discrimination, Exclusion and Toxicity

"Social harms that arise from the language model producing discriminatory or exclusionary speech"

AI Risk1. Discrimination & Toxicity1.0 > Discrimination & Toxicity2 - Post-deployment

Record summary

A quick snapshot of what this page covers.

Techniques0Attack methods connected to this risk.
Mitigations0Defenses that may help with related attacks.
Domain1. Discrimination & ToxicityThe broad risk area this belongs to.

Risk profile

How this risk is described and categorized.

Domain1. Discrimination & Toxicity
Subdomain1.0 > Discrimination & Toxicity
Entity2 - AI
Intent2 - Unintentional
Timing2 - Post-deployment
CategoryDiscrimination, Exclusion and Toxicity
Subcategoryn/a

Suggested mitigations

Defenses that may help with related attacks.

No propagated mitigations. No defense is available through the connected attack methods.

Source

Research source for this risk, when available.