Record summary
A quick snapshot of what this page covers.
Risk profile
How this risk is described and categorized.
"Rigidity and Mistaken Commitments. Even when it is desirable to be able to make threats in order to deter socially harmful behaviour, doing so using AI agents effectively removes the human from the loop, which could prove disastrous in high-stakes contexts (e.g., a false positive in a nuclear sub- marine’s warning system; see also Case Study 11), or when irresponsible actors are enabled in making disproportionate or mistaken commitments."
Suggested mitigations
Defenses that may help with related attacks.
Source
Research source for this risk, when available.
Included resource
Multi-Agent Risks from Advanced AI
Original source
MIT AI Risk Repository
Open the public repository used for AI risk records and taxonomy fields.
