Record summary
A quick snapshot of what this page covers.
Risk profile
How this risk is described and categorized.
The controversial views expressed by large models are also a widely discussed concern. Bang et al. (2021) evaluated several large models and found that they occasionally express inappropriate or extremist views when discussing political top-ics. Furthermore, models like ChatGPT (OpenAI, 2022) that claim political neutrality and aim to provide objective information for users have been shown to exhibit notable left-leaning political biases in areas like economics, social policy, foreign affairs, and civil liberties.
Suggested mitigations
Defenses that may help with related attacks.
Source
Research source for this risk, when available.
Included resource
Towards Safer Generative Language Models: A Survey on Safety Risks, Evaluations, and Improvements
Original source
MIT AI Risk Repository
Open the public repository used for AI risk records and taxonomy fields.
