Jailbreaking - PromptRiskDB

Record summary

A quick snapshot of what this page covers.

Techniques7Attack methods connected to this risk.

Mitigations5Defenses that may help with related attacks.

Domain2. Privacy & SecurityThe broad risk area this belongs to.

Risk profile

How this risk is described and categorized.

Domain2. Privacy & Security

Subdomain2.2 > AI system security vulnerabilities and attacks

Entity1 - Human

Intent1 - Intentional

Timing2 - Post-deployment

CategoryInference risks (Multi-category)

SubcategoryJailbreaking

Related techniques

Attack methods connected to this risk.

AML.T0040 - AI Model Inference API Access

realized

Methodtaxonomy_keyword_ruleConfidence72%

AML.T0054 - LLM Jailbreak

demonstrated

Methodtaxonomy_keyword_ruleConfidence70%

AML.T0016.002 - Generative AI

realized

Methodtaxonomy_keyword_ruleConfidence68%

AML.T0061 - LLM Prompt Self-Replication

demonstrated

Methodtaxonomy_keyword_ruleConfidence65%

AML.T0008.005 - AI Service Proxies

realized

Methodtaxonomy_keyword_ruleConfidence55%

AML.T0111 - AI Supply Chain Reputation Inflation

demonstrated

Methodtaxonomy_keyword_ruleConfidence55%

AML.T0084.001 - Tool Definitions

demonstrated

Methodtaxonomy_keyword_ruleConfidence55%

Suggested mitigations

Defenses that may help with related attacks.

Control Access to AI Models and Data in Production

DeploymentMonitoring and Maintenance

LifecycleDeployment + 1 moreCategoryPolicy

AI Telemetry Logging

DeploymentMonitoring and Maintenance

LifecycleDeployment + 1 moreCategoryTechnical - Cyber

Generative AI Guardrails

ML Model EngineeringML Model Evaluation+1 more

LifecycleML Model Engineering + 2 moreCategoryTechnical - ML

Generative AI Guidelines

ML Model EngineeringML Model Evaluation+1 more

LifecycleML Model Engineering + 2 moreCategoryTechnical - ML

Generative AI Model Alignment

ML Model EngineeringML Model Evaluation+1 more

LifecycleML Model Engineering + 2 moreCategoryTechnical - ML

Source

Research source for this risk, when available.

Included resource

AI Risk Atlas

AuthorsIBMYear2025TypeWebsite

URLhttps://www.ibm.com/docs/en/watsonx/saas?topic=ai-risk-atlas

Original source

MIT AI Risk Repository

Open the public repository used for AI risk records and taxonomy fields.

Repositoryhttps://airisk.mit.edu/