Nascent capabilities (emergent capabilities)

Record summary

A quick snapshot of what this page covers.

Techniques0Attack methods connected to this risk.

Mitigations0Defenses that may help with related attacks.

Domainn/aThe broad risk area this belongs to.

Risk profile

How this risk is described and categorized.

Example: "Deception: Park et al. have established that generative AI models may pursue their goals via deception. Another study by Pan et al. highlighted unethical behaviors.431 For instance, during a pre-release experiment, the GPT-4 model feigned being a visually impaired human to coax an online worker into solving a CAPTCHA (a puzzle used by many websites to weed out automated responses from those of individual humans). When prompted to explain its reasoning, the model said: “I should not reveal that I am a robot. I should invent an excuse for why I cannot solve CAPTCHAs.”

Domainn/a

Subdomainn/a

Entityn/a

Intentn/a

Timingn/a

CategoryEthical and social risks

SubcategoryNascent capabilities (emergent capabilities)

Related techniques

Attack methods connected to this risk.

No linked attack methods. No AI attack method is connected to this risk in the current data.

Suggested mitigations

Defenses that may help with related attacks.

No propagated mitigations. No defense is available through the connected attack methods.

Source

Research source for this risk, when available.

Included resource

Regulating under Uncertainty: Governance Options for Generative AI

AuthorsG'sellYear2024TypeReport

DOI10.2139/ssrn.4918704 URLhttps://papers.ssrn.com/sol3/papers.cfm?abstract_id=4918704

Original source

MIT AI Risk Repository

Open the public repository used for AI risk records and taxonomy fields.

Repositoryhttps://airisk.mit.edu/