Prompt Leaking - PromptRiskDB

Record summary

A quick snapshot of what this page covers.

Techniques6Attack methods connected to this risk.

Mitigations13Defenses that may help with related attacks.

Domain2. Privacy & SecurityThe broad risk area this belongs to.

Risk profile

How this risk is described and categorized.

Domain2. Privacy & Security

Subdomain2.1 > Compromise of privacy by leaking or correctly inferring sensitive information

Entity1 - Human

Intent1 - Intentional

Timing2 - Post-deployment

CategoryInstruction Attacks

SubcategoryPrompt Leaking

Related techniques

Attack methods connected to this risk.

AML.T0056 - Extract LLM System Prompt

feasible

Methodtext_similarity_sqliteConfidence62%

AML.T0069.002 - System Prompt

demonstrated

Methodtext_similarity_sqliteConfidence56%

AML.T0069 - Discover LLM System Information

demonstrated

Methodtext_similarity_sqliteConfidence56%

AML.T0010 - AI Supply Chain Compromise

realized

Methodtaxonomy_keyword_ruleConfidence56%

AML.T0037 - Data from Local System

realized

Methodtaxonomy_keyword_ruleConfidence56%

AML.T0086 - Exfiltration via AI Agent Tool Invocation

realized

Methodtaxonomy_keyword_ruleConfidence55%

Suggested mitigations

Defenses that may help with related attacks.

Generative AI Guardrails

ML Model EngineeringML Model Evaluation+1 more

LifecycleML Model Engineering + 2 moreCategoryTechnical - ML

Generative AI Guidelines

ML Model EngineeringML Model Evaluation+1 more

LifecycleML Model Engineering + 2 moreCategoryTechnical - ML

Generative AI Model Alignment

ML Model EngineeringML Model Evaluation+1 more

LifecycleML Model Engineering + 2 moreCategoryTechnical - ML

Verify AI Artifacts

Business and Data UnderstandingData Preparation+1 more

LifecycleBusiness and Data Understanding + 2 moreCategoryTechnical - Cyber

AI Bill of Materials

Business and Data UnderstandingData Preparation+1 more

LifecycleBusiness and Data Understanding + 2 moreCategoryPolicy

AI Telemetry Logging

DeploymentMonitoring and Maintenance

LifecycleDeployment + 1 moreCategoryTechnical - Cyber

Privileged AI Agent Permissions Configuration

Deployment

LifecycleDeploymentCategoryTechnical - Cyber

Single-User AI Agent Permissions Configuration

Deployment

LifecycleDeploymentCategoryTechnical - Cyber

AI Agent Tools Permissions Configuration

Deployment

LifecycleDeploymentCategoryTechnical - Cyber

Human In-the-Loop for AI Agent Actions

Deployment

LifecycleDeploymentCategoryTechnical - ML

Restrict AI Agent Tool Invocation on Untrusted Data

Deployment

LifecycleDeploymentCategoryTechnical - ML

Segmentation of AI Agent Components

DeploymentBusiness and Data Understanding

LifecycleDeployment + 1 moreCategoryTechnical - Cyber

Input and Output Validation for AI Agent Components

Business and Data UnderstandingData Preparation+1 more

LifecycleBusiness and Data Understanding + 2 moreCategoryTechnical - ML

Source

Research source for this risk, when available.

Included resource

Safety Assessment of Chinese Large Language Models

AuthorsSun et al.Year2023TypePreprint

DOI10.48550/arXiv.2304.10436 URLhttps://arxiv.org/abs/2304.10436

Original source

MIT AI Risk Repository

Open the public repository used for AI risk records and taxonomy fields.

Repositoryhttps://airisk.mit.edu/