Record summary
A quick snapshot of what this page covers.
Techniques6Attack methods connected to this risk.
Mitigations9Defenses that may help with related attacks.
Domain4. Malicious Actors & MisuseThe broad risk area this belongs to.
Risk profile
How this risk is described and categorized.
Domain4. Malicious Actors & Misuse
Subdomain4.2 > Cyberattacks, weapon development or use, and mass harm
Entity1 - Human
Intent1 - Intentional
Timing2 - Post-deployment
CategoryRisk area 4: Malicious Uses
SubcategoryAssisting code generation for cyber security threats
Suggested mitigations
Defenses that may help with related attacks.
Control Access to AI Models and Data at Rest
Business and Data UnderstandingData Preparation+2 more
Validate AI Model
ML Model EvaluationMonitoring and Maintenance
Code Signing
Deployment
Model Hardening
Data PreparationML Model Engineering
Use Ensemble Methods
ML Model Engineering
Use Multi-Modal Sensors
Business and Data UnderstandingData Preparation+1 more
Input Restoration
Data PreparationML Model Evaluation+2 more
Adversarial Input Detection
Data PreparationML Model Engineering+3 more
Deepfake Detection
DeploymentMonitoring and Maintenance+2 more
Source
Research source for this risk, when available.
Included resource
Taxonomy of Risks posed by Language Models
Original source
MIT AI Risk Repository
Open the public repository used for AI risk records and taxonomy fields.
