Risk area 2: Information Hazards

Record summary

A quick snapshot of what this page covers.

Techniques3Attack methods connected to this risk.

Mitigations8Defenses that may help with related attacks.

Domain2. Privacy & SecurityThe broad risk area this belongs to.

Risk profile

How this risk is described and categorized.

"LM predictions that convey true information may give rise to information hazards, whereby the dissemination of private or sensitive information can cause harm [27]. Information hazards can cause harm at the point of use, even with no mistake of the technology user. For example, revealing trade secrets can damage a business, revealing a health diagnosis can cause emotional distress, and revealing private data can violate a person’s rights. Information hazards arise from the LM providing private data or sensitive information that is present in, or can be inferred from, training data. Observed risks include privacy violations [34]. Mitigation strategies include algorithmic solutions and responsible model release strategies."

Domain2. Privacy & Security

Subdomain2.1 > Compromise of privacy by leaking or correctly inferring sensitive information

Entity2 - AI

Intent2 - Unintentional

Timing2 - Post-deployment

CategoryRisk area 2: Information Hazards

Subcategoryn/a

Related techniques

Attack methods connected to this risk.

Suggested mitigations

Defenses that may help with related attacks.

Source

Research source for this risk, when available.

Included resource

Taxonomy of Risks posed by Language Models

AuthorsWeidinger et al.Year2022TypeConference Paper

DOI10.1145/3531146.3533088 URLhttps://doi.org/10.1145/3531146.3533088

Original source

MIT AI Risk Repository

Open the public repository used for AI risk records and taxonomy fields.

Repositoryhttps://airisk.mit.edu/

Risk area 2: Information Hazards

Record summary

Risk profile

Suggested mitigations

Control Access to AI Models and Data at Rest

Encrypt Sensitive Information

AI Model Distribution Methods

Restrict Library Loading

Code Signing

Vulnerability Scanning

User Training

AI Bill of Materials

Source

Taxonomy of Risks posed by Language Models

MIT AI Risk Repository