Attacks on GPAIs/GPAI Failure Modes

Record summary

A quick snapshot of what this page covers.

Techniques2Attack methods connected to this risk.

Mitigations2Defenses that may help with related attacks.

Domainn/aThe broad risk area this belongs to.

Risk profile

How this risk is described and categorized.

"This section catalogs the risk sources related to GPAI failure modes or attacks targeting GPAIs. Many of these apply mainly to LLM-based GPAIs, which share some common failure modes such as jailbreaks and trojans. These vulnerabilities often extend beyond GPAIs and fall into the broader field of adversarial machine learning. However, additional vulnerabilities may arise with the introduction of new modalities, longer context windows, or different encodings."

Domainn/a

SubdomainX.1 > Excluded

Entity4 - Not coded

Intent4 - Not coded

Timing4 - Not coded

CategoryAttacks on GPAIs/GPAI Failure Modes

Subcategoryn/a