Harmful output - PromptRiskDB

Record summary

A quick snapshot of what this page covers.

Techniques1Attack methods connected to this risk.

Mitigations0Defenses that may help with related attacks.

Domain1. Discrimination & ToxicityThe broad risk area this belongs to.

How this risk is described and categorized.

Domain1. Discrimination & Toxicity

Subdomain1.2 > Exposure to toxic content

Entity2 - AI

Intent2 - Unintentional

Timing2 - Post-deployment

CategoryOutput risks (Value alignment)

SubcategoryHarmful output

Attack methods connected to this risk.

realized

Methodtext_similarity_sqliteConfidence65%

Defenses that may help with related attacks.

No propagated mitigations. No defense is available through the connected attack methods.

Research source for this risk, when available.

Included resource

AuthorsIBMYear2025TypeWebsite

Original source

Open the public repository used for AI risk records and taxonomy fields.