Record summary
A quick snapshot of what this page covers.
Techniques1Attack methods connected to this risk.
Mitigations4Defenses that may help with related attacks.
Domain1. Discrimination & ToxicityThe broad risk area this belongs to.
Risk profile
How this risk is described and categorized.
Domain1. Discrimination & Toxicity
Subdomain1.3 > Unequal performance across groups
Entity2 - AI
Intent2 - Unintentional
Timing3 - Other
CategoryFairness
SubcategoryDisparate Performance
Suggested mitigations
Defenses that may help with related attacks.
Restrict Number of AI Model Queries
Business and Data UnderstandingDeployment+1 more
Generative AI Guardrails
ML Model EngineeringML Model Evaluation+1 more
Generative AI Guidelines
ML Model EngineeringML Model Evaluation+1 more
Generative AI Model Alignment
ML Model EngineeringML Model Evaluation+1 more
Source
Research source for this risk, when available.
Included resource
Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment
Original source
MIT AI Risk Repository
Open the public repository used for AI risk records and taxonomy fields.
