Social Norm - PromptRiskDB

Record summary

A quick snapshot of what this page covers.

Techniques1Attack methods connected to this risk.

Mitigations0Defenses that may help with related attacks.

Domain1. Discrimination & ToxicityThe broad risk area this belongs to.

How this risk is described and categorized.

Domain1. Discrimination & Toxicity

Subdomain1.2 > Exposure to toxic content

Entity2 - AI

Intent3 - Other

Timing2 - Post-deployment

CategorySocial Norm

Subcategoryn/a

Attack methods connected to this risk.

demonstrated

Methodtext_similarity_sqliteConfidence52%

Defenses that may help with related attacks.

No propagated mitigations. No defense is available through the connected attack methods.

Research source for this risk, when available.

Included resource

AuthorsLiu et al.Year2024TypePreprint

Original source

Open the public repository used for AI risk records and taxonomy fields.