Private Training Data

Record summary

A quick snapshot of what this page covers.

Techniques1Attack methods connected to this risk.

Mitigations4Defenses that may help with related attacks.

Domain2. Privacy & SecurityThe broad risk area this belongs to.

Risk profile

How this risk is described and categorized.

"As recent LLMs continue to incorporate licensed, created, and publicly available data sources in their corpora, the potential to mix private data in the training corpora is significantly increased. The misused private data, also named as personally identifiable information (PII) [84], [86], could contain various types of sensitive data subjects, including an individual person’s name, email, phone number, address, education, and career. Generally, injecting PII into LLMs mainly occurs in two settings — the exploitation of web-collection data and the alignment with personal humanmachine conversations [87]. Specifically, the web-collection data can be crawled from online sources with sensitive PII, and the personal human-machine conversations could be collected for SFT and RLHF"

Domain2. Privacy & Security

Subdomain2.1 > Compromise of privacy by leaking or correctly inferring sensitive information

Entity1 - Human

Intent2 - Unintentional

Timing1 - Pre-deployment

CategoryPrivacy Leakage

SubcategoryPrivate Training Data

Related techniques

Attack methods connected to this risk.

AML.T0010.002 - Data

realized

Methodtaxonomy_keyword_ruleConfidence59%

Suggested mitigations

Defenses that may help with related attacks.

Source

Research source for this risk, when available.

Included resource

Risk Taxonomy, Mitigation, and Assessment Benchmarks of Large Language Model Systems

AuthorsCui et al.Year2024TypePreprint

DOI10.48550/arXiv.2401.05778 URLhttps://arxiv.org/abs/2401.05778

Original source

MIT AI Risk Repository

Open the public repository used for AI risk records and taxonomy fields.

Repositoryhttps://airisk.mit.edu/

Record summary

Risk profile

Suggested mitigations

Control Access to AI Models and Data at Rest

Sanitize Training Data

Verify AI Artifacts

Maintain AI Dataset Provenance

Source

Risk Taxonomy, Mitigation, and Assessment Benchmarks of Large Language Model Systems

MIT AI Risk Repository