Record summary
A quick snapshot of what this page covers.
Risk profile
How this risk is described and categorized.
"Agents that reason about their own computational resources and logically uncertain events can encounter strange paradoxes due to Godelian limitations (Fallenstein and Soares, 2015; Soares and Fallenstein, 2014, 2017) and shortcomings of probability theory (Soares and Fallenstein, 2014, 2015, 2017). They may also be reflectively unstable, preferring to change the principles by which they select actions (Arbital, 2018)."
Suggested mitigations
Defenses that may help with related attacks.
Restrict Number of AI Model Queries
Control Access to AI Models and Data in Production
Source
Research source for this risk, when available.
Included resource
AGI Safety Literature Review
Original source
MIT AI Risk Repository
Open the public repository used for AI risk records and taxonomy fields.
