Restrict AI Agent Tool Invocation on Untrusted Data - AI Mitigation

AI Mitigation

Untrusted data can contain prompt injections that invoke an AI agent's tools, potentially causing confidentiality, integrity or availability violations. It is recommended that tool invocation be restricted or limited when untrusted data enters the LLM's context. The degree to which tool invocation is restricted may depend on the potential consequences of the action. Consider blocking the automatic invocation of to...

Overview

A source-backed snapshot of this defense.

The degree to which tool invocation is restricted may depend on the potential consequences of the action. Consider blocking the automatic invocation of tools or requiring user confirmation once untrusted data enters the LLM's context. For high consequence actions, consider always requiring user confirmation.

Techniques3Attacks this defense is designed to help with.

Lifecycle1Where this defense applies in the AI lifecycle.

Categories1How the source groups this defense.

Safeguard details

Where this defense applies and how the source classifies it.

ATLAS ID: AML.M0030
Priority score: 15

Deployment

Technical - ML

Covered techniques

Attacks this defense is designed to help with.

3 recordsView all techniques →

Source evidence

Original public records and references for this page.

View all sources →

Original source

Original source links

Open the public records and source datasets used for this page.

Repositoryhttps://github.com/mitre-atlas/atlas-data ATLAS.yamlhttps://github.com/mitre-atlas/atlas-data/blob/main/dist/ATLAS.yaml Schemahttps://github.com/mitre-atlas/atlas-data/blob/main/dist/schemas/atlas_output_schema.json