X April 8, 2026 at 07:41 PM

2041809409802641862

2 corrections found

Claim

The report describes some very interesting examples of this behavior:

Correction

Anthropic’s official Mythos materials do not describe the four examples listed here. The Mythos announcement and Red Team post are about cybersecurity evaluations and highlight bugs/exploits in OpenBSD, FFmpeg, Linux, browsers, and FreeBSD instead.

Full reasoning

This sentence misattributes the bullet-point examples that follow to Anthropic’s Claude Mythos Preview report.

What Anthropic says the Mythos materials are about:

The official Project Glasswing announcement says Mythos Preview was used to identify zero-day vulnerabilities and that the accompanying Red Team post gives technical details for a subset of those vulnerabilities.
Anthropic then explicitly lists the example cases it is highlighting: a 27-year-old OpenBSD vulnerability, a 16-year-old FFmpeg vulnerability, and a Linux kernel exploit chain.
The separate Claude Mythos Preview Red Team post likewise says it is providing technical details on Mythos’s cybersecurity capabilities, including zero-day vulnerabilities and exploits, and gives examples like OpenBSD, FreeBSD NFS, browsers, and Linux privilege escalation.

So the Mythos report is a cybersecurity capability report, not a report centered on the four bullet-point scenarios in this post.

Anthropic does have a separate line of research on misalignment/sabotage, but that is a different publication. Anthropic’s Pilot Sabotage Risk Report says it focuses primarily on Claude Opus 4 and assesses sabotage risk in deployed models as of summer 2025. That means the post is mixing together different Anthropic publications and attributing the later bullet-point stories to the Mythos report.

In short: the official Mythos report does not present the examples listed below this sentence as its examples; Anthropic’s own Mythos materials present a different set of cybersecurity case studies.

3 sources

Project Glasswing: Securing critical software for the AI era | Anthropic
Anthropic says the Red Team post provides technical details for patched vulnerabilities and then lists three examples: a 27-year-old OpenBSD vulnerability, a 16-year-old FFmpeg vulnerability, and a Linux kernel exploit chain.
Claude Mythos Preview | red.anthropic.com
Anthropic says the post provides technical details on Mythos Preview’s cybersecurity capabilities, including finding and exploiting zero-day vulnerabilities in real open-source codebases and turning N-day vulnerabilities into exploits.
Anthropic's Pilot Sabotage Risk Report
Anthropic says this separate report addresses misalignment risk and focuses primarily on risks involving the behavior of Claude Opus 4, not Claude Mythos Preview.

Claim

Mythos wants to help so much that Anthropic decided it's dangerous to release it.

Correction

Anthropic did not withhold Mythos entirely. It launched Project Glasswing, gave Mythos Preview access to launch partners and 40+ additional organizations, and says participants will be able to keep using it after the preview; Anthropic only said it would not be made generally available.

Full reasoning

This overstates Anthropic’s decision.

Anthropic did not decide not to release Mythos Preview at all. Instead, Anthropic announced a limited release through Project Glasswing:

launch partners such as AWS, Apple, Cisco, Google, Microsoft, NVIDIA, and others were given access;
Anthropic says it also extended access to more than 40 additional organizations that build or maintain critical software infrastructure;
and Anthropic says that after the research preview, Mythos Preview will be available to participants at stated API pricing.

What Anthropic actually says is that it does not plan to make Mythos Preview generally available. That is different from saying Anthropic decided it was too dangerous to release at all.

So the accurate description is: restricted/gated release, not no release.

2 sources

Project Glasswing: Securing critical software for the AI era | Anthropic
Anthropic says launch partners will use Mythos Preview, that access was extended to over 40 additional organizations, and that afterward Claude Mythos Preview will be available to participants at $25/$125 per million input/output tokens.
Claude Mythos Preview | red.anthropic.com
Anthropic says: 'By releasing this model initially to a limited group of critical industry partners and open source developers with Project Glasswing, we aim to enable defenders...' and separately says it does not plan to make Mythos Preview generally available.

Model: OPENAI_GPT_5 Prompt: v1.16.0