PromptRiskDBThreat intelligence atlas
AI Risk

General Evaluations (Biased evaluations of encoded human values)

"Encoded human values in AI models that are easier to evaluate might be preferred for inclusion in evaluations over those that are more difficult to measure [13]. This might come at the expense of more desirable but harder-to-quantify values. This bias can lead to an imbalance, where easier-to-measure values dominate the evaluation process, while other important values are underrepresented."

AI Risk6. Socioeconomic and Environmental6.5 > Governance failure1 - Pre-deployment

Record summary

A quick snapshot of what this page covers.

Techniques0Attack methods connected to this risk.
Mitigations0Defenses that may help with related attacks.
Domain6. Socioeconomic and EnvironmentalThe broad risk area this belongs to.

Risk profile

How this risk is described and categorized.

Domain6. Socioeconomic and Environmental
Subdomain6.5 > Governance failure
Entity1 - Human
Intent2 - Unintentional
Timing1 - Pre-deployment
CategoryModel Evaluations
SubcategoryGeneral Evaluations (Biased evaluations of encoded human values)

Suggested mitigations

Defenses that may help with related attacks.

No propagated mitigations. No defense is available through the connected attack methods.

Source

Research source for this risk, when available.