Direct Harm Domains (content safety harms)

Record summary

A quick snapshot of what this page covers.

Techniques2Attack methods connected to this risk.

Mitigations2Defenses that may help with related attacks.

Domain1. Discrimination & ToxicityThe broad risk area this belongs to.

How this risk is described and categorized.

Domain1. Discrimination & Toxicity

Subdomain1.2 > Exposure to toxic content

Entity4 - Not coded

Intent4 - Not coded

Timing4 - Not coded

CategoryDirect Harm Domains (content safety harms)

Subcategoryn/a

Attack methods connected to this risk.

realized

Methodtext_similarity_sqliteConfidence69%

realized

Methodtext_similarity_sqliteConfidence61%

Defenses that may help with related attacks.

DeploymentMonitoring and Maintenance

LifecycleDeployment + 1 moreCategoryTechnical - Cyber

Business and Data UnderstandingData Preparation+1 more

LifecycleBusiness and Data Understanding + 2 moreCategoryTechnical - ML

Research source for this risk, when available.

Included resource

AuthorsGipiškis et al.Year2024TypeJournal Article

Original source

Open the public repository used for AI risk records and taxonomy fields.