Record summary
A quick snapshot of what this page covers.
Techniques1Attack methods connected to this risk.
Mitigations4Defenses that may help with related attacks.
Domain7. AI System Safety, Failures, & LimitationsThe broad risk area this belongs to.
Risk profile
How this risk is described and categorized.
Domain7. AI System Safety, Failures, & Limitations
Subdomain7.3 > Lack of capability or robustness
Entity2 - AI
Intent2 - Unintentional
Timing2 - Post-deployment
CategoryReliability
SubcategoryInconsistency
Suggested mitigations
Defenses that may help with related attacks.
Restrict Number of AI Model Queries
Business and Data UnderstandingDeployment+1 more
Generative AI Guardrails
ML Model EngineeringML Model Evaluation+1 more
Generative AI Guidelines
ML Model EngineeringML Model Evaluation+1 more
Generative AI Model Alignment
ML Model EngineeringML Model Evaluation+1 more
Source
Research source for this risk, when available.
Included resource
Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment
Original source
MIT AI Risk Repository
Open the public repository used for AI risk records and taxonomy fields.
