Record summary
A quick snapshot of what this page covers.
Risk profile
How this risk is described and categorized.
"These risks arise from the LM outputting false, misleading, nonsensical or poor quality information, without malicious intent of the user. (The deliberate generation of "disinformation", false information that is intended to mislead, is discussed in the section on Malicious Uses.) Resulting harms range from unintentionally misinforming or deceiving a person, to causing material harm, and amplifying the erosion of societal distrust in shared information. Several risks listed here are well-documented in current large-scale LMs as well as in other language technologies"
Suggested mitigations
Defenses that may help with related attacks.
Source
Research source for this risk, when available.
Included resource
Taxonomy of Risks posed by Language Models
Original source
MIT AI Risk Repository
Open the public repository used for AI risk records and taxonomy fields.