Record summary
A quick snapshot of what this page covers.
Risk profile
How this risk is described and categorized.
"Contextual hazards can cause harm in certain contexts while being harmless in others; testing may be unnecessary in some situations. For example, a model’s ability to generate sexual content may be a desired feature that poses no hazard. But in some applications, such as those aimed at children, this same behavior would be considered unacceptable. In cases where a particular contextual hazard is relevant to the application, assessment-standard implementers could exclude that category. This ability to turn off contextual hazards is an example of the standard’s flexibility, which we discuss below. Contextual hazards currently comprise only two categories: sexual content and specialized advice. Future versions will likely expand this group."
Suggested mitigations
Defenses that may help with related attacks.
Source
Research source for this risk, when available.
Included resource
AILUMINATE: Introducing v1.0 of the AI Risk and Reliability Benchmark from MLCommons
Original source
MIT AI Risk Repository
Open the public repository used for AI risk records and taxonomy fields.
