Record summary
A quick snapshot of what this page covers.
Attack context
How this AI attack works in practice.
- ATLAS ID
- AML.T0059
- Priority score
- 96
Mitigations
Defenses that may help against this attack.
AML.M0025 - Maintain AI Dataset Provenance
Maintaining dataset provenance can help identify adverse changes to the data.
AML.M0007 - Sanitize Training Data
Remediating poisoned data can re-establish dataset integrity.
Case studies
Examples from public reports and exercises.
Web-Scale Data Poisoning: Split-View Attack
Many recent large-scale datasets are distributed as a list of URLs pointing to individual datapoints. The researchers show that many of these datasets are vulnerable to a "split-view" poisoning attack. The attack exploits the fact that the data viewed when it was initially collected may differ from the data viewed by a user during training. The researchers identify expired and buyable domains that once hosted dataset content, making it possible to replace portions of the dataset with poisoned data. They demonstrate that for 10 popular web-scale datasets, enough of the domains are purchasable to successfully carry out a poisoning attack.
Source
Where this page information comes from.
Original source
Original source links
Open the public records and source datasets used for this page.