PromptRiskDBThreat intelligence atlas
AI Risk

Lack of understanding of in-context learning in language models

"In-context learning allows the model to learn a new task or improve its perfor- mance by providing examples in the prompt, without changing its weights [101]. Even though this technique is highly effective, its working mechanism is not well understood. Since many potential misuses are directly related to prompting, it becomes difficult to guarantee safety when the exact mechanism of in-context learning is not ful...

AI Risk7. AI System Safety, Failures, & Limitations7.4 > Lack of transparency or interpretability3 - Other

Record summary

A quick snapshot of what this page covers.

Techniques0Attack methods connected to this risk.
Mitigations0Defenses that may help with related attacks.
Domain7. AI System Safety, Failures, & LimitationsThe broad risk area this belongs to.

Risk profile

How this risk is described and categorized.

"In-context learning allows the model to learn a new task or improve its perfor- mance by providing examples in the prompt, without changing its weights [101]. Even though this technique is highly effective, its working mechanism is not well understood. Since many potential misuses are directly related to prompting, it becomes difficult to guarantee safety when the exact mechanism of in-context learning is not fully investigated [13]."

Domain7. AI System Safety, Failures, & Limitations
Subdomain7.4 > Lack of transparency or interpretability
Entity3 - Other
Intent3 - Other
Timing3 - Other
CategoryAttacks on GPAIs/GPAI Failure Modes
SubcategoryLack of understanding of in-context learning in language models

Suggested mitigations

Defenses that may help with related attacks.

No propagated mitigations. No defense is available through the connected attack methods.

Source

Research source for this risk, when available.