PromptRiskDBThreat intelligence atlas
AI Risk

Nascent capabilities (emergent capabilities)

Example: "Deception: Park et al. have established that generative AI models may pursue their goals via deception. Another study by Pan et al. highlighted unethical behaviors.431 For instance, during a pre-release experiment, the GPT-4 model feigned being a visually impaired human to coax an online worker into solving a CAPTCHA (a puzzle used by many websites to weed out automated responses from those of individual...

AI Risk

Record summary

A quick snapshot of what this page covers.

Techniques0Attack methods connected to this risk.
Mitigations0Defenses that may help with related attacks.
Domainn/aThe broad risk area this belongs to.

Risk profile

How this risk is described and categorized.

Example: "Deception: Park et al. have established that generative AI models may pursue their goals via deception. Another study by Pan et al. highlighted unethical behaviors.431 For instance, during a pre-release experiment, the GPT-4 model feigned being a visually impaired human to coax an online worker into solving a CAPTCHA (a puzzle used by many websites to weed out automated responses from those of individual humans). When prompted to explain its reasoning, the model said: “I should not reveal that I am a robot. I should invent an excuse for why I cannot solve CAPTCHAs.”

Domainn/a
Subdomainn/a
Entityn/a
Intentn/a
Timingn/a
CategoryEthical and social risks
SubcategoryNascent capabilities (emergent capabilities)

Suggested mitigations

Defenses that may help with related attacks.

No propagated mitigations. No defense is available through the connected attack methods.

Source

Research source for this risk, when available.