Hallucinations - PromptRiskDB

Record summary

A quick snapshot of what this page covers.

Techniques0Attack methods connected to this risk.

Mitigations0Defenses that may help with related attacks.

Domain3. MisinformationThe broad risk area this belongs to.

Risk profile

How this risk is described and categorized.

"The inclusion of erroneous information in the outputs from AI systems is not new. Some have cautioned against the introduction of false structures in X-ray or MRI images, and others have warned about made-up academic references. However, as ChatGPT-type tools become available to the general population, the scale of the problem may increase dramatically. Furthermore, it is compounded by the fact that these conversational AIs present true and false information with the same apparent “confidence” instead of declining to answer when they cannot ensure correctness. With less knowledgeable people, this can lead to the heightening of misinformation and potentially dangerous situations. Some have already led to court cases.'

Domain3. Misinformation

Subdomain3.1 > False or misleading information

Entity2 - AI

Intent2 - Unintentional

Timing2 - Post-deployment

CategoryHallucinations

Subcategoryn/a

Related techniques

Attack methods connected to this risk.

No linked attack methods. No AI attack method is connected to this risk in the current data.

Suggested mitigations

Defenses that may help with related attacks.

No propagated mitigations. No defense is available through the connected attack methods.

Source

Research source for this risk, when available.

Included resource

Navigating the Landscape of AI Ethics and Responsibility

AuthorsCunha & EstimaYear2023TypeConference Paper

DOI10.1007/978-3-031-49008-8_8 URLhttps://doi.org/10.1007/978-3-031-49008-8_8

Original source

MIT AI Risk Repository

Open the public repository used for AI risk records and taxonomy fields.

Repositoryhttps://airisk.mit.edu/