PromptRiskDBThreat intelligence atlas
AI Risk

Attacks on GPAIs/GPAI Failure Modes

"This section catalogs the risk sources related to GPAI failure modes or attacks targeting GPAIs. Many of these apply mainly to LLM-based GPAIs, which share some common failure modes such as jailbreaks and trojans. These vulnerabilities often extend beyond GPAIs and fall into the broader field of adversarial machine learning. However, additional vulnerabilities may arise with the introduction of new modalities, lo...

AI RiskX.1 > Excluded4 - Not coded

Record summary

A quick snapshot of what this page covers.

Techniques2Attack methods connected to this risk.
Mitigations2Defenses that may help with related attacks.
Domainn/aThe broad risk area this belongs to.

Risk profile

How this risk is described and categorized.

"This section catalogs the risk sources related to GPAI failure modes or attacks targeting GPAIs. Many of these apply mainly to LLM-based GPAIs, which share some common failure modes such as jailbreaks and trojans. These vulnerabilities often extend beyond GPAIs and fall into the broader field of adversarial machine learning. However, additional vulnerabilities may arise with the introduction of new modalities, longer context windows, or different encodings."

Domainn/a
SubdomainX.1 > Excluded
Entity4 - Not coded
Intent4 - Not coded
Timing4 - Not coded
CategoryAttacks on GPAIs/GPAI Failure Modes
Subcategoryn/a

Suggested mitigations

Defenses that may help with related attacks.

AI Telemetry Logging

DeploymentMonitoring and Maintenance
LifecycleDeployment + 1 moreCategoryTechnical - Cyber

Source

Research source for this risk, when available.