category
AI Risks
Common risks that can happen when AI systems are built, deployed, or used.
Showing 501-520 of 1686 records
Dystopian trajectory lock-in because of misuse of advanced AI to establish and/or maintain totalitarian regimes; is an AI risk in 6. Socioeconomic and Enviro...
Existential disaster because of conflict between AI systems and multi-system interactions is an AI risk in 7. AI System Safety, Failures, & Limitations focus...
Extreme “suffering risks” because of a misaligned system is an AI risk in 7. AI System Safety, Failures, & Limitations focused on 7.1 > AI pursuing its own g...
Gradual, irretrievable ceding of human power over the future to AI systems is an AI risk in 5. Human-Computer Interaction focused on 5.2 > Loss of human agen...
Existential disaster because of misaligned superintelligence or power-seeking AI is an AI risk in 7. AI System Safety, Failures, & Limitations focused on 7.1...
Deception is an AI risk in 7. AI System Safety, Failures, & Limitations focused on 7.2 > AI possessing dangerous capabilities. It is most relevant during 2 -...
Anonymous resource acquisition is an AI risk in 7. AI System Safety, Failures, & Limitations focused on 7.2 > AI possessing dangerous capabilities. It is mos...
Autonomous replication is an AI risk in 7. AI System Safety, Failures, & Limitations focused on 7.2 > AI possessing dangerous capabilities. It is most releva...
Self-improvement is an AI risk in 7. AI System Safety, Failures, & Limitations focused on 7.2 > AI possessing dangerous capabilities. It is most relevant dur...
Acquisition of goals to seek power and control is an AI risk in 7. AI System Safety, Failures, & Limitations focused on 7.1 > AI pursuing its own goals in co...
Situational awareness is an AI risk in 7. AI System Safety, Failures, & Limitations focused on 7.2 > AI possessing dangerous capabilities. It is most relevan...
Dangerous capabilities in AI systems is an AI risk in 7. AI System Safety, Failures, & Limitations focused on 7.2 > AI possessing dangerous capabilities. It...
Harms from increasingly agentic algorithmic systems is an AI risk in 7. AI System Safety, Failures, & Limitations focused on 7.2 > AI possessing dangerous ca...
Language model misalignment is an AI risk in 7. AI System Safety, Failures, & Limitations focused on 7.1 > AI pursuing its own goals in conflict with human g...
Inner misalignment is an AI risk in 7. AI System Safety, Failures, & Limitations focused on 7.1 > AI pursuing its own goals in conflict with human goals or v...
Goal misgeneralization is an AI risk in 7. AI System Safety, Failures, & Limitations focused on 7.1 > AI pursuing its own goals in conflict with human goals...
Instrumental convergence is an AI risk in 7. AI System Safety, Failures, & Limitations focused on 7.1 > AI pursuing its own goals in conflict with human goal...
Reward model overoptimization is an AI risk in 7. AI System Safety, Failures, & Limitations focused on 7.1 > AI pursuing its own goals in conflict with human...
Specification gaming is an AI risk in 7. AI System Safety, Failures, & Limitations focused on 7.1 > AI pursuing its own goals in conflict with human goals or...
Faulty reward functions in the wild is an AI risk in 7. AI System Safety, Failures, & Limitations focused on 7.1 > AI pursuing its own goals in conflict with...