Attack on Machine Translation Services - AI Case Study

Overview

Case steps9Steps described in the case record.

Techniques9Attack methods mentioned in the case steps.

Linked CVEs0Known vulnerabilities mentioned in the record.

Risk patterns

Patterns found in the case record and its linked vulnerabilities.

1Dominant ATLAS tactic. Impact appears in 3 case steps.
2Multiple attack methods. The case connects to 9 unique AI attack methods.

Procedure timeline

Search the case steps or filter them by attacker goal.

Impact3Resource Development2AI Attack Staging2Reconnaissance1AI Model Access1

Step 1
Search Open Technical Databases
Reconnaissance

The researchers used published research papers to identify the datasets and model architectures used by the target translation services.
Step 2
Datasets
Resource Development

The researchers gathered similar datasets that the target translation services used.
Step 3
Models
Resource Development

The researchers gathered similar model architectures that the target translation services used.
Step 4
AI Model Inference API Access
AI Model Access

They abused a public facing application to query the model and produced machine translated sentence pairs as training data.
Step 5
Train Proxy via Replication
AI Attack Staging

Using these translated sentence pairs, the researchers trained a model that replicates the behavior of the target model.
Step 6
AI Intellectual Property Theft
Impact

By replicating the model with high fidelity, the researchers demonstrated that an adversary could steal a model and violate the victim's intellectual property rights.
Step 7
Black-Box Transfer
AI Attack Staging

The replicated models were used to generate adversarial examples that successfully transferred to the black-box translation services.
Step 8
Evade AI Model
Impact

The adversarial examples were used to evade the machine translation services by a variety of means. This included targeted word flips, vulgar outputs, and dropped sentences.
Step 9
Erode AI Model Integrity
Impact

Adversarial attacks can cause errors that cause reputational damage to the company of the translation service and decrease user trust in AI-powered services.

Mitigations

Defenses connected to the attack methods in this case.

Sources

Original public records and references for this case.

Original source

Original source links

Open the MITRE ATLAS data and public references used for this case study.

Repositoryhttps://github.com/mitre-atlas/atlas-data ATLAS.yamlhttps://github.com/mitre-atlas/atlas-data/blob/main/dist/ATLAS.yaml Schemahttps://github.com/mitre-atlas/atlas-data/blob/main/dist/schemas/atlas_output_schema.json Wallace, Eric, et al. "Imitation Attacks and Defenses for Black-box Machine Translation Systems" EMNLP 2020https://arxiv.org/abs/2004.15015 Project Page, "Imitation Attacks and Defenses for Black-box Machine Translation Systems"https://www.ericswallace.com/imitation Google under fire for mistranslating Chinese amid Hong Kong protestshttps://thehill.com/policy/international/asia-pacific/449164-google-under-fire-for-mistranslating-chinese-amid-hong-kong/