Injustice - PromptRiskDB

Record summary

A quick snapshot of what this page covers.

Techniques1Attack methods connected to this risk.

Mitigations0Defenses that may help with related attacks.

Domain1. Discrimination & ToxicityThe broad risk area this belongs to.

How this risk is described and categorized.

Domain1. Discrimination & Toxicity

Subdomain1.1 > Unfair discrimination and misrepresentation

Entity2 - AI

Intent2 - Unintentional

Timing2 - Post-deployment

CategoryFairness

SubcategoryInjustice

Attack methods connected to this risk.

demonstrated

Methodtext_similarity_sqliteConfidence55%

Defenses that may help with related attacks.

No propagated mitigations. No defense is available through the connected attack methods.

Research source for this risk, when available.

Included resource

AuthorsLiu et al.Year2024TypePreprint

Original source

Open the public repository used for AI risk records and taxonomy fields.