Generative AI Guidelines - AI Mitigation

AI Mitigation

Guidelines are safety controls that are placed between user-provided input and a generative AI model to help direct the model to produce desired outputs and prevent undesired outputs. Guidelines can be implemented as instructions appended to all user prompts or as part of the instructions in the system prompt. They can define the goal(s), role, and voice of the system, as well as outline safety and security parame...

Overview

A source-backed snapshot of this defense.

Guidelines are safety controls that are placed between user-provided input and a generative AI model to help direct the model to produce desired outputs and prevent undesired outputs.

Guidelines can be implemented as instructions appended to all user prompts or as part of the instructions in the system prompt. They can define the goal(s), role, and voice of the system, as well as outline safety and security parameters.

Techniques7Attacks this defense is designed to help with.

Lifecycle3Where this defense applies in the AI lifecycle.

Categories1How the source groups this defense.

Safeguard details

Where this defense applies and how the source classifies it.

ATLAS ID: AML.M0021
Priority score: 35

ML Model EngineeringML Model EvaluationDeployment

Technical - ML

Covered techniques

Attacks this defense is designed to help with.

7 recordsView all techniques →

Showing 4 of 7

Source evidence

Original public records and references for this page.

View all sources →

Original source

Original source links

Open the public records and source datasets used for this page.

Repositoryhttps://github.com/mitre-atlas/atlas-data ATLAS.yamlhttps://github.com/mitre-atlas/atlas-data/blob/main/dist/ATLAS.yaml Schemahttps://github.com/mitre-atlas/atlas-data/blob/main/dist/schemas/atlas_output_schema.json

Generative AI Guidelines - AI Mitigation

Overview

Safeguard details

Covered techniques

AML.T0053 - AI Agent Tool Invocation

AML.T0062 - Discover LLM Hallucinations

AML.T0056 - Extract LLM System Prompt

AML.T0057 - LLM Data Leakage

Source evidence

Original source links