category

AI Risks

Common risks that can happen when AI systems are built, deployed, or used.

Showing 501-520 of 1686 records

Dystopian trajectory lock-in because of misuse of advanced AI to establish and/or maintain totalitarian regimes;

Dystopian trajectory lock-in because of misuse of advanced AI to establish and/or maintain totalitarian regimes; is an AI risk in 6. Socioeconomic and Enviro...

Existential disaster because of conflict between AI systems and multi-system interactions

Existential disaster because of conflict between AI systems and multi-system interactions is an AI risk in 7. AI System Safety, Failures, & Limitations focus...

Extreme “suffering risks” because of a misaligned system

Extreme “suffering risks” because of a misaligned system is an AI risk in 7. AI System Safety, Failures, & Limitations focused on 7.1 > AI pursuing its own g...

Gradual, irretrievable ceding of human power over the future to AI systems

Gradual, irretrievable ceding of human power over the future to AI systems is an AI risk in 5. Human-Computer Interaction focused on 5.2 > Loss of human agen...

Existential disaster because of misaligned superintelligence or power-seeking AI

Existential disaster because of misaligned superintelligence or power-seeking AI is an AI risk in 7. AI System Safety, Failures, & Limitations focused on 7.1...

Deception

Deception is an AI risk in 7. AI System Safety, Failures, & Limitations focused on 7.2 > AI possessing dangerous capabilities. It is most relevant during 2 -...

Anonymous resource acquisition

Anonymous resource acquisition is an AI risk in 7. AI System Safety, Failures, & Limitations focused on 7.2 > AI possessing dangerous capabilities. It is mos...

Autonomous replication

Autonomous replication is an AI risk in 7. AI System Safety, Failures, & Limitations focused on 7.2 > AI possessing dangerous capabilities. It is most releva...

Self-improvement

Self-improvement is an AI risk in 7. AI System Safety, Failures, & Limitations focused on 7.2 > AI possessing dangerous capabilities. It is most relevant dur...

Acquisition of goals to seek power and control

Acquisition of goals to seek power and control is an AI risk in 7. AI System Safety, Failures, & Limitations focused on 7.1 > AI pursuing its own goals in co...

Situational awareness

Situational awareness is an AI risk in 7. AI System Safety, Failures, & Limitations focused on 7.2 > AI possessing dangerous capabilities. It is most relevan...

Dangerous capabilities in AI systems

Dangerous capabilities in AI systems is an AI risk in 7. AI System Safety, Failures, & Limitations focused on 7.2 > AI possessing dangerous capabilities. It...

Harms from increasingly agentic algorithmic systems

Harms from increasingly agentic algorithmic systems is an AI risk in 7. AI System Safety, Failures, & Limitations focused on 7.2 > AI possessing dangerous ca...

Language model misalignment

Language model misalignment is an AI risk in 7. AI System Safety, Failures, & Limitations focused on 7.1 > AI pursuing its own goals in conflict with human g...

Inner misalignment

Inner misalignment is an AI risk in 7. AI System Safety, Failures, & Limitations focused on 7.1 > AI pursuing its own goals in conflict with human goals or v...

Goal misgeneralization

Goal misgeneralization is an AI risk in 7. AI System Safety, Failures, & Limitations focused on 7.1 > AI pursuing its own goals in conflict with human goals...

Instrumental convergence

Instrumental convergence is an AI risk in 7. AI System Safety, Failures, & Limitations focused on 7.1 > AI pursuing its own goals in conflict with human goal...

Reward model overoptimization

Reward model overoptimization is an AI risk in 7. AI System Safety, Failures, & Limitations focused on 7.1 > AI pursuing its own goals in conflict with human...

Specification gaming

Specification gaming is an AI risk in 7. AI System Safety, Failures, & Limitations focused on 7.1 > AI pursuing its own goals in conflict with human goals or...

Faulty reward functions in the wild

Faulty reward functions in the wild is an AI risk in 7. AI System Safety, Failures, & Limitations focused on 7.1 > AI pursuing its own goals in conflict with...