Avenues for exploiting user trust and accessing more private information

Record summary

A quick snapshot of what this page covers.

Techniques4Attack methods connected to this risk.

Mitigations7Defenses that may help with related attacks.

Domain5. Human-Computer InteractionThe broad risk area this belongs to.

Risk profile

How this risk is described and categorized.

Anticipated risk: "In conversation, users may reveal private information that would otherwise be difficult to access, such as opinions or emotions. Capturing such information may enable downstream applications that violate privacy rights or cause harm to users, e.g. via more effective recommendations of addictive applications. In one study, humans who interacted with a ‘human-like’ chatbot disclosed more private information than individuals who interacted with a ‘machine-like’ chatbot [87]."

Domain5. Human-Computer Interaction

Subdomain5.1 > Overreliance and unsafe use

Entity3 - Other

Intent2 - Unintentional

Timing2 - Post-deployment

CategoryRisk area 5: Human-Computer Interaction Harms

SubcategoryAvenues for exploiting user trust and accessing more private information

Related techniques

Attack methods connected to this risk.

Suggested mitigations

Defenses that may help with related attacks.

Source

Research source for this risk, when available.

Included resource

Taxonomy of Risks posed by Language Models

AuthorsWeidinger et al.Year2022TypeConference Paper

DOI10.1145/3531146.3533088 URLhttps://doi.org/10.1145/3531146.3533088

Original source

MIT AI Risk Repository

Open the public repository used for AI risk records and taxonomy fields.

Repositoryhttps://airisk.mit.edu/

Avenues for exploiting user trust and accessing more private information

Record summary

Risk profile

Suggested mitigations

User Training

Deepfake Detection

AI Telemetry Logging

Privileged AI Agent Permissions Configuration

Single-User AI Agent Permissions Configuration

AI Agent Tools Permissions Configuration

Segmentation of AI Agent Components

Source

Taxonomy of Risks posed by Language Models

MIT AI Risk Repository