Lack of capability for task

Record summary

A quick snapshot of what this page covers.

Techniques1Attack methods connected to this risk.

Mitigations5Defenses that may help with related attacks.

Domain7. AI System Safety, Failures, & LimitationsThe broad risk area this belongs to.

Risk profile

How this risk is described and categorized.

"As we have seen, this could be due to the skill not being required during the training process (perhaps due to issues with the training data) or because the learnt skill was quite brittle and was not generalisable to a new situation (lack of robustness to distributional shift). In particular, advanced AI assistants may not have the capability to represent complex concepts that are pertinent to their own ethical impact, for example the concept of 'benefitting the user' or 'when the user asks' or representing 'the way in which a user expects to be benefitted'."

Domain7. AI System Safety, Failures, & Limitations

Subdomain7.3 > Lack of capability or robustness

Entity2 - AI

Intent2 - Unintentional

Timing1 - Pre-deployment

CategoryCapability failures

SubcategoryLack of capability for task

Related techniques

Attack methods connected to this risk.

AML.T0011.001 - Malicious Package

realized

Methodtext_similarity_sqliteConfidence54%

Suggested mitigations

Defenses that may help with related attacks.

Source

Research source for this risk, when available.

Included resource

The Ethics of Advanced AI Assistants

AuthorsGabriel et al.Year2024TypePreprint

DOI10.48550/arXiv.2404.16244 URLhttps://doi.org/10.48550/arXiv.2404.16244

Original source

MIT AI Risk Repository

Open the public repository used for AI risk records and taxonomy fields.

Repositoryhttps://airisk.mit.edu/

Lack of capability for task

Record summary

Risk profile

Suggested mitigations

Restrict Library Loading

Code Signing

Vulnerability Scanning

User Training

AI Bill of Materials

Source

The Ethics of Advanced AI Assistants

MIT AI Risk Repository