Model diversion - PromptRiskDB

Record summary

A quick snapshot of what this page covers.

Techniques1Attack methods connected to this risk.

Mitigations0Defenses that may help with related attacks.

Domain4. Malicious Actors & MisuseThe broad risk area this belongs to.

How this risk is described and categorized.

Domain4. Malicious Actors & Misuse

Subdomain4.2 > Cyberattacks, weapon development or use, and mass harm

Entity1 - Human

Intent1 - Intentional

Timing2 - Post-deployment

CategoryMisuse tactics to compromise GenAI systems (Model integrity)

SubcategoryModel diversion

Attack methods connected to this risk.

realized

Methodtaxonomy_keyword_ruleConfidence55%

Defenses that may help with related attacks.

No propagated mitigations. No defense is available through the connected attack methods.

Research source for this risk, when available.

Included resource

AuthorsMarchal & XuYear2024TypeJournal Article

Original source

Open the public repository used for AI risk records and taxonomy fields.