Record summary
A quick snapshot of what this page covers.
Techniques5Attack methods connected to this risk.
Mitigations1Defenses that may help with related attacks.
Domain7. AI System Safety, Failures, & LimitationsThe broad risk area this belongs to.
Risk profile
How this risk is described and categorized.
Domain7. AI System Safety, Failures, & Limitations
Subdomain7.1 > AI pursuing its own goals in conflict with human goals or values
Entity2 - AI
Intent1 - Intentional
Timing3 - Other
CategoryDeception
Subcategoryn/a
Suggested mitigations
Defenses that may help with related attacks.
Code Signing
Deployment
Source
Research source for this risk, when available.
Included resource
X-Risk Analysis for AI Research
Original source
MIT AI Risk Repository
Open the public repository used for AI risk records and taxonomy fields.
