Social stereotypes and unfair discrimination

Record summary

A quick snapshot of what this page covers.

Techniques1Attack methods connected to this risk.

Mitigations0Defenses that may help with related attacks.

Domain1. Discrimination & ToxicityThe broad risk area this belongs to.

How this risk is described and categorized.

Domain1. Discrimination & Toxicity

Subdomain1.1 > Unfair discrimination and misrepresentation

Entity2 - AI

Intent2 - Unintentional

Timing3 - Other

CategoryRisk area 1: Discrimination, Hate speech and Exclusion

SubcategorySocial stereotypes and unfair discrimination

Attack methods connected to this risk.

demonstrated

Methodtext_similarity_sqliteConfidence55%

Defenses that may help with related attacks.

No propagated mitigations. No defense is available through the connected attack methods.

Research source for this risk, when available.

Included resource

AuthorsWeidinger et al.Year2022TypeConference Paper

Original source

Open the public repository used for AI risk records and taxonomy fields.