Search
Find a vulnerability
Search criteria
ⓘ
Use this form to refine search results.
Full-text search supports keyword queries with ranking and filtering.
You can combine vendor, product, and sources to narrow results.
Enable “Apply ordering” to sort by date instead of relevance.
1 vulnerability by OpenAI GPT-2
AVID-2023-V008
Vulnerability from avid – Published: 2023-03-31 – Updated: 2023-03-31 ATLAS Case StudySummary
OpenAI built GPT-2, a language model capable of generating high quality text samples. Over concerns that GPT-2 could be used for malicious purposes such as impersonating others, or generating misleading news articles, fake social media content, or spam, OpenAI adopted a tiered release schedule. They initially released a smaller, less powerful version of GPT-2 along with a technical description of the approach, but held back the full trained model.
Before the full model was released by OpenAI, researchers at Brown University successfully replicated the model using information released by OpenAI and open source ML artifacts. This demonstrates that a bad actor with sufficient technical skill and compute resources could have replicated GPT-2 and used it for harmful goals before the AI Security community is prepared.
Risk domain
Security
SEP view
S0502: Model theft
Lifecycle
L04: Model Development, L06: Deployment
Organisations
OpenAI GPT-2 (deployer)
Affected artifacts
1 artifact
| Artifact | Type |
|---|---|
| OpenAI GPT-2 | System |
References
3 references
| URL | Label |
|---|---|
| https://atlas.mitre.org/studies/AML.CS0007 | GPT-2 Model Replication |
| https://www.wired.com/story/dangerous-ai-open-source/ | Wired Article, "OpenAI Said Its Code Was Risky. Two Grads Re-Created It Anyway" |
| https://blog.usejournal.com/opengpt-2-we-replicat… | Medium BlogPost, "OpenGPT-2: We Replicated GPT-2 Because You Can Too" |