Record summary
A quick snapshot of what this page covers.
Risk profile
How this risk is described and categorized.
"These harms relate to violations of an individual’s or group’s moral or legal right to privacy. Such harms may be exacerbated by assistants that influence users to disclose personal information or private information that pertains to others. Resultant harms might include identity theft, or stigmatisation and discrimination based on individual or group characteristics. This could have a detrimental impact, particularly on marginalised communities. Furthermore, in principle, state-owned AI assistants could employ manipulation or deception to extract private information for surveillance purposes."
Suggested mitigations
Defenses that may help with related attacks.
Restrict Number of AI Model Queries
Control Access to AI Models and Data in Production
AI Telemetry Logging
Verify AI Artifacts
Generative AI Guardrails
AI Bill of Materials
Restrict Library Loading
Code Signing
Vulnerability Scanning
User Training
Source
Research source for this risk, when available.
Included resource
The Ethics of Advanced AI Assistants
Original source
MIT AI Risk Repository
Open the public repository used for AI risk records and taxonomy fields.
