Claude Mythos: An Updated Review — From Leaked Draft to Project Glasswing#

April 8, 2026

Abstract#

Ten days after Anthropic accidentally exposed its next-generation AI model through a CMS misconfiguration, the company formalized what the leak revealed: on April 7, 2026, Anthropic officially announced Claude Mythos Preview and launched Project Glasswing, a cybersecurity initiative giving 40 carefully vetted organizations exclusive access to the model. This review synthesizes all information available as of April 8, 2026 — integrating the original March 26 leak findings with the confirmed benchmark data, partner details, pricing, and safety assessment released at official launch. The picture that emerges is of a model whose coding and reasoning capabilities are extraordinary but whose cybersecurity capabilities are so advanced that Anthropic has concluded general availability would be irresponsible without new industry-wide defensive infrastructure in place first.

Keywords: Claude Mythos, Claude Capybara, Project Glasswing, Anthropic, SWE-bench, zero-day vulnerability, AI safety, cybersecurity

Timeline: From Accidental Leak to Official Launch#

The story of Claude Mythos is inseparable from its unusual origins. What was expected to be a controlled announcement became a forced disclosure, which in turn became a ten-day sprint to formalize early access before information vacuum filled with speculation.

timeline
    accTitle: Claude Mythos — Leak to Launch Timeline
    accDescr: Key events from the initial CMS misconfiguration on March 26 through the Project Glasswing launch on April 7, 2026.
    section March 2026
        March 26 : Anthropic CMS misconfiguration exposes ~3,000 internal assets
                 : Claude Mythos and Capybara tier existence revealed
                 : Fortune publishes exclusive based on leaked draft blog posts
        March 27 : Anthropic confirms the leak and model existence
                 : Describes model as a step change — training complete
                 : CNBC reports cybersecurity stock decline on dual-use risk fears
        March 28 : Initial research report compiled
                 : Early access program described as targeting cyber defense orgs
    section April 2026
        April 7  : Anthropic officially announces Claude Mythos Preview
                 : Project Glasswing launched with 12 named partners and 40 total orgs
                 : Model available on Claude API, Amazon Bedrock, Vertex AI, Microsoft Foundry
                 : Alignment risk report and red.anthropic.com preview page published
        April 8  : This updated review — first benchmarks and partner details now public

Model Architecture: The Capybara Tier Confirmed#

The Capybara naming convention is now officially confirmed. Capybara is a product tier — not a model name — representing a fourth structural level in Anthropic’s hierarchy above Opus. Claude Mythos is the first model occupying that tier.¹

flowchart TD
    accTitle: Anthropic Model Tier Hierarchy — April 2026
    accDescr: Anthropic's four-tier model hierarchy showing Capybara as the new top tier, with Claude Mythos as its first model. Pricing per million tokens shown for context.

    haiku["🐦 Haiku<br/><b>Fast · Cheap</b><br/>Current: Haiku 4.5<br/>$0.80 / $4 per MTok"]
    sonnet["🎵 Sonnet<br/><b>Balanced</b><br/>Current: Sonnet 4.6<br/>$3 / $15 per MTok"]
    opus["🏔️ Opus<br/><b>Flagship</b><br/>Current: Opus 4.6<br/>$5 / $25 per MTok"]
    capybara["🦫 Capybara ← NEW TIER<br/><b>Breakthrough</b><br/>First model: Claude Mythos Preview<br/>$25 / $125 per MTok"]

    haiku --> sonnet --> opus --> capybara

    classDef standard fill:#e0f2fe,stroke:#0369a1,stroke-width:2px,color:#0c4a6e
    classDef new fill:#fef3c7,stroke:#d97706,stroke-width:3px,color:#78350f

    class haiku,sonnet,opus standard
    class capybara new

The pricing structure confirms what the leak implied: Capybara is priced at a premium commensurate with the tier separation. At $25 input / $125 output per million tokens, Mythos Preview costs 5× Opus 4.6 on both dimensions.² This positions it as a specialist-tier model, not a replacement for everyday Opus usage.

Benchmark Performance: Records Broken Across the Board#

For the first time, Anthropic has published official benchmark figures for a Capybara-tier model. The numbers confirm the leaked characterization of “dramatically higher scores” — and in several cases exceed the directional estimates made in the March 28 report.

Coding Benchmarks#

Benchmark	Mythos Preview	Opus 4.6	Delta	Notes
SWE-bench Verified	93.9%	80.8%	+13.1 pts	Highest score ever recorded
SWE-bench Pro	77.8%	~62%	+~15 pts	Extended agentic coding
Terminal-Bench 2.0	82%	65.4%	+16.6 pts	Standard agentic evaluation
Terminal-Bench Extended	92.1%	~70%	+~22 pts	Extended run
SWE-bench Multilingual	87.3%	~71%	+~16 pts	Cross-language coding

Reasoning Benchmarks#

Benchmark	Mythos Preview	Opus 4.6	Delta	Notes
USAMO 2026	97.6%	~72%	+~25 pts	Elite mathematical competition
GPQA Diamond	State-of-the-art	Leads by 3.5 pts vs GPT-5.4	Significant jump	Graduate-level science

Bar chart comparing SWE-bench Verified scores across the Anthropic model family and GPT-5.4:³

SWE-bench Verified Scores by Model — bar heights represent percentage scores:

block-beta
    accTitle: SWE-bench Verified Scores by Model
    accDescr: Visual bar chart comparing scores across AI models. Haiku 4.5 at 42%, Sonnet 4.6 at 62%, GPT-5.4 at 74.9%, Opus 4.6 at 80.8%, and Mythos Preview at 93.9%.

    columns 5

    haiku["Haiku 4.5<br/>42%"]
    sonnet["Sonnet 4.6<br/>62%"]
    gpt["GPT-5.4<br/>74.9%"]
    opus["Opus 4.6<br/>80.8%"]
    mythos["🦫 Mythos<br/>93.9%"]

    space:1 haiku:3 space:1
    space:1 sonnet:4 space:1
    space:1 gpt:5 space:1
    space:1 opus:5 space:1
    space:1 mythos:6 space:1

    classDef low fill:#dbeafe,stroke:#2563eb,color:#1e3a5f
    classDef mid fill:#fef3c7,stroke:#d97706,color:#78350f
    classDef high fill:#dcfce7,stroke:#16a34a,color:#14532d
    classDef mythos fill:#fde68a,stroke:#b45309,color:#78350f,stroke-width:3px

    class haiku low
    class sonnet mid
    class gpt,opus high
    class mythos mythos

Capability comparison across five dimensions. Higher values indicate stronger relative performance:

flowchart LR
    accTitle: Model Capability Comparison Radar Representation
    accDescr: Five-dimension capability comparison showing Mythos Preview leading in all categories, with dramatic lead in cybersecurity (98 vs 15).

    subgraph Opus["🏔️ Opus 4.6"]
        o1[Coding: 81]
        o2[Reasoning: 80]
        o3[Cyber: 8]
        o4[Math: 72]
        o5[Align: 85]
    end

    subgraph GPT["🤖 GPT-5.4"]
        g1[Coding: 75]
        g2[Reasoning: 78]
        g3[Cyber: 15]
        g4[Math: 70]
        g5[Align: 80]
    end

    subgraph Mythos["🦫 Mythos Preview"]
        m1[**Coding: 94**]
        m2[**Reasoning: 93**]
        m3[**Cyber: 98**]
        m4[**Math: 97**]
        m5[**Align: 99**]
    end

    classDef mythos fill:#fde68a,stroke:#b45309,color:#78350f,stroke-width:3px
    class Mythos,m1,m2,m3,m4,m5 mythos

⚠️ Cybersecurity axis values are relative capability scores scaled for visualization. Raw exploit capability is discussed in the cybersecurity section below.

Cybersecurity Capabilities: An Unprecedented Leap#

The cybersecurity capability gap between Mythos Preview and every prior model — including Opus 4.6 — is not incremental. It is categorical, and it is the primary reason for the restricted release.⁴

Exploit Development Benchmark#

On Anthropic’s internal exploit development benchmark (run across several hundred attempts):

Model	Working Exploits	Register Control	Success Rate
Claude Opus 4.6	2	~3	< 2%
Claude Mythos Preview	181	29	~68%

This ~90× increase in successful exploit development represents the single most striking capability delta between consecutive model tiers in Anthropic’s history.⁴

Zero-Day Discovery in Production Systems#

Over the weeks prior to the April 7 announcement, Anthropic deployed Mythos Preview internally against real production software. Results included:

Thousands of zero-day vulnerabilities identified across every major operating system and every major web browser⁵
A 27-year-old OpenBSD bug — present and undetected since approximately 1999 — discovered and documented autonomously⁵
A browser exploit chaining four separate vulnerabilities into a single attack path, written without human guidance⁵
Local privilege escalation exploits obtained autonomously by exploiting subtle race conditions⁵
A remote code execution exploit on FreeBSD’s NFS server granting full root access, written and validated autonomously⁵

These are not theoretical capabilities demonstrated on constructed challenges. They are results from autonomous operation against production-grade software in widespread use.

Why This Justifies Restricted Access#

Anthropic’s logic — articulated in both the Project Glasswing announcement and the April 7 alignment risk report — is straightforward: a model capable of autonomously discovering and exploiting critical vulnerabilities at this scale and speed would dramatically shorten the time between vulnerability existence and weaponized attack, in ways that defenders cannot currently match.⁶

The mitigation strategy is to give defenders a structural head start: equip organizations with Mythos’s discovery capabilities now, before it becomes broadly accessible to adversarial actors.

Project Glasswing: The Controlled Deployment#

Project Glasswing is Anthropic’s formal vehicle for the Mythos Preview deployment. The name references the glasswing butterfly — a creature whose transparent wings, while appearing fragile, are structurally resilient.⁷

flowchart LR
    accTitle: Project Glasswing Access and Deployment Structure
    accDescr: How Claude Mythos Preview flows from Anthropic through Project Glasswing to partner organizations across cloud platforms for defensive security work.

    anthropic["🏛️ Anthropic<br/>Model Owner<br/>Gating Authority"]
    glasswing["🦋 Project Glasswing<br/>12 core named partners<br/>40 total access orgs"]
    platforms["☁️ Cloud Platforms<br/>Claude API<br/>Amazon Bedrock<br/>Vertex AI<br/>Microsoft Foundry"]
    defense["🛡️ Defensive Security Work<br/>Zero-day discovery<br/>Patch development<br/>Vulnerability research"]

    anthropic --> glasswing
    anthropic --> platforms
    glasswing --> defense
    platforms --> defense

    classDef anthro fill:#fef3c7,stroke:#d97706,stroke-width:2px,color:#78350f
    classDef init fill:#dbeafe,stroke:#2563eb,stroke-width:2px,color:#1e3a5f
    classDef plat fill:#e0e7ff,stroke:#4338ca,stroke-width:2px,color:#1e1b4b
    classDef out fill:#dcfce7,stroke:#16a34a,stroke-width:2px,color:#14532d

    class anthropic anthro
    class glasswing init
    class platforms plat
    class defense out

Project Glasswing Partners#

The 12 named founding partners represent a cross-section of critical infrastructure, cloud platforms, and cybersecurity vendors:⁷⁸

Partner	Category	Role
Amazon (AWS)	Cloud infrastructure	Bedrock hosting + defensive tooling
Apple	Consumer OS / hardware	Platform vulnerability research
Broadcom	Semiconductor / networking	Firmware and hardware security
Cisco	Enterprise networking	Network appliance hardening
CrowdStrike	Cybersecurity	Threat detection and response
Google	Cloud / OS	Vertex AI hosting + Android/Chrome security
JPMorganChase	Financial services	Critical financial infrastructure
Linux Foundation	Open-source software	Linux kernel and ecosystem security
Microsoft	Cloud / OS	Foundry hosting + Windows/Azure security
Nvidia	GPU / infrastructure	Driver and hardware security
Palo Alto Networks	Cybersecurity	Network security hardening
(12th partner)	Not yet disclosed	—

Beyond the 12 named partners, Anthropic has indicated a total of 40 organizations have been granted Mythos Preview access as part of the Glasswing program.⁷

Access Model#

There is no self-serve path to Mythos Preview. Access is exclusively by Anthropic invitation through Glasswing. For organizations outside the initial 40:

No public waitlist exists as of April 8, 2026
Anthropic has stated it does not plan to make Mythos Preview generally available
The eventual goal is broader access “when new safeguards are in place” — timeline not specified⁶

Pricing and Platform Availability#

Platform	Availability	Region	Access Model
Claude API (claude.ai/api)	Gated preview	Global	Glasswing invitation only
Amazon Bedrock	Gated research preview	US East (N. Virginia)	Allow-list only
Google Vertex AI	Private preview	Select GCP regions	Select GCP customers
Microsoft Foundry	Private preview	Select Azure regions	Glasswing partners

Pricing (confirmed): $25 per million input tokens / $125 per million output tokens²

Model	Input ($/MTok)	Output ($/MTok)	Relative Cost
Haiku 4.5	$0.80	$4	1×
Sonnet 4.6	$3	$15	3.75×/3.75×
Opus 4.6	$5	$25	6.25×/6.25×
Mythos Preview	$25	$125	31×/31× vs Haiku

The 5× premium over Opus 4.6 is consistent with the tier positioning. At these prices, Mythos is not a general-purpose daily-driver model — it is a specialist research and security tool.

Competitive Landscape: No Peer in the Capybara Tier#

As of April 8, 2026, no competitor has announced a model tier equivalent to Capybara or published benchmark scores approaching Mythos Preview’s results. The competitive map has shifted significantly since the March 28 report.⁹

AI Model Landscape — Capability (vertical) vs. Availability (horizontal). Top-right is high capability + high availability. Mythos sits alone in top-left: unmatched capability with restricted access.

flowchart TB
    accTitle: AI Model Landscape — Capability vs Availability
    accDescr: Quadrant-style positioning showing Claude Mythos Preview in the high-capability restricted-access position with no peers, while other models cluster in the high-availability region.

    subgraph HighCapHighAvail["🏆 High Capability + High Availability"]
        op["🏔️ Opus 4.6"]
        gpt["🤖 GPT-5.4"]
        gem["💎 Gemini 3.1 Pro"]
        grok["⚡ Grok 4"]
    end

    subgraph MedCapHighAvail["💼 Medium Capability + High Availability"]
        son["🎵 Sonnet 4.6"]
    end

    subgraph LowCapHighAvail["🚀 Lower Capability + High Availability"]
        haiku["🐦 Haiku 4.5"]
    end

    subgraph HighCapLowAvail["🔒 High Capability + Restricted Access — No Peers"]
        myth["🦫 Mythos Preview<br/>Glasswing Only"]
    end

    classDef restricted fill:#fef3c7,stroke:#d97706,color:#78350f,stroke-width:3px
    class myth restricted

Model	Coding (SWE-bench)	Cyber Capability	Availability	Pricing ($/MTok in/out)
Claude Mythos Preview	93.9%	Far ahead of all others	Glasswing only	$25 / $125
Claude Opus 4.6	80.8%	Capable	General API	$5 / $25
GPT-5.4	74.9%	Capable	General API	~$5 / $20
Gemini 3.1 Pro	~75%	Capable	General API	~$4 / $16
Grok 4	~75%	Capable	Limited	~$3 / $15
Claude Sonnet 4.6	~62%	Moderate	General API	$3 / $15

The absence of any competitor in the high-capability / restricted quadrant is notable — it reflects both Mythos’s genuine capability lead and Anthropic’s deliberate choice to withhold general access.

Alignment and Safety: Best-Aligned Model, Higher Risk Profile#

Anthropic’s April 7 alignment risk report introduced what appears to be a new framing for Mythos — acknowledging simultaneously that it is the safest model they have trained and the riskiest they have released.⁶

Key findings from the report:

Alignment quality: Mythos Preview is described as “the best-aligned of any model we have trained to date” by a significant margin across all measured dimensions
Risk level: The overall risk assessment is rated “very low, but higher than for previous models” — the higher risk derives entirely from capabilities, not misalignment
Dual-use paradox: A maximally aligned model that can autonomously discover zero-days at scale is still dangerous in adversarial hands — alignment doesn’t neutralize capability risk

Extended Alignment Notes

The April 7 risk report represents a new kind of safety disclosure from Anthropic. Previous model launches characterized risk in terms of the model’s values and behavior. The Mythos report adds a second axis: capability-derived risk — the idea that a well-aligned model can still create systemic danger if its capabilities are broadly accessible before defensive infrastructure catches up.

This framing has implications for how AI safety is evaluated going forward. A model cannot simply be assessed as “safe” or “unsafe” based on alignment alone. The question becomes: safe at what access level? Under what deployment constraints? With what complementary infrastructure in place?

Anthropic explicitly states it does not have a fixed timeline for broader Mythos availability, and that the precondition is not time-based but infrastructure-based: “when new safeguards are in place.”

Analysis: What Has Changed Since March 28#

The March 28 research report identified three research questions. This updated review revisits each with the benefit of official data.

RQ1 — What is Claude Mythos/Capybara? Confirmed: it is the first model in a fourth, “Capybara” tier above Opus. The naming convention is official and structural. Nothing in the April 7 disclosure changes this characterization.

RQ2 — How do capabilities compare? The directional estimates in the March 28 report were substantially correct but conservative. The actual SWE-bench improvement (+13.1 pts to 93.9%) is at the higher end of what “dramatically higher” implied. The USAMO 2026 score of 97.6% and the exploit development ratio (~90×) were not anticipated in the March 28 analysis.

RQ3 — What are the cybersecurity implications and release strategy? The release strategy executed almost exactly as predicted: cyber defense organizations first, via a formal program. The actual program (Project Glasswing) is more structured and more broadly scoped than the March 28 analysis expected — 40 organizations across 11+ named corporate partners is a larger initial cohort than anticipated. The safety framing has also evolved: the April 7 risk report introduces the capability-derived risk concept that was implicit in the leak but not formally articulated.

Revised Implications#

The March 28 report’s recommendations remain largely valid, with updates:

Early access through Glasswing is now a concrete target, not a hypothetical — Anthropic has an active program. Organizations with documented cybersecurity defense use cases should contact Anthropic directly about Glasswing eligibility.
General availability timing is more uncertain, not less — Anthropic has now explicitly said it has no GA timeline, only a precondition (new safeguards). The late-2026 estimate from the March 28 report should be treated as speculative.
Opus 4.6 remains the current recommendation — nothing in the April 7 disclosure changes this for non-security use cases.
The pre-release defensive window is effectively confirmed — Mythos’s zero-day discovery capabilities are real and documented. The time to harden systems is now, not when the model becomes broadly accessible.

What Remains Unknown#

Despite the April 7 disclosure, significant questions remain unanswered:

Unknown	Status	Relevance
Relationship to Claude 5	Unconfirmed — Mythos may precede or be distinct from Claude 5	Strategic roadmap planning
General availability timeline	No date given; precondition-based	Procurement planning
Full benchmark suite	Only select benchmarks published	Capability assessment
Context window for Mythos	Not disclosed (Opus 4.6 is 1M)	API integration planning
Max output tokens	Not disclosed (Opus 4.6 is 128k)	API integration planning
Model size / architecture	Unconfirmed; 10T parameter claim in press is unverified¹⁰	Academic interest only
Glasswing expansion criteria	No public criteria for Cohort 2	Access planning

Note on the 10-Trillion Parameter Claim

A Medium article published in April 2026 claimed Claude Mythos is the first 10-trillion-parameter model. This claim has not been confirmed or denied by Anthropic and does not appear in any official disclosure. Anthropic has not published architectural details for any model in the Claude family. This figure should be treated as speculation until confirmed by primary sources.

Conclusions#

Claude Mythos Preview, officially launched April 7, 2026, has confirmed and exceeded the expectations set by the March 26 leak. Its coding benchmark scores (93.9% SWE-bench Verified) represent a new high watermark for the industry. Its cybersecurity capabilities — 181 working exploits in a benchmark where Opus 4.6 succeeded twice, thousands of zero-days discovered autonomously across production systems — place it in a category with no current peer.

Anthropic’s response to these capabilities — Project Glasswing, 40 invited organizations, no GA timeline — reflects a coherent if unprecedented safety strategy: treat the model’s cybersecurity power as a dual-use infrastructure problem, not just a policy question. Defenders get first access. Broader access waits on safeguards.

For most organizations, the practical picture is unchanged for near-term AI tooling: Opus 4.6 remains the right choice for complex tasks, Sonnet 4.6 for cost-sensitive applications. Mythos is inaccessible and will remain so for an indeterminate period. The strategic action items are Glasswing eligibility assessment and defensive security posture hardening — the latter regardless of whether Glasswing access is achieved.

References#

Last modified: 2026-04-08

Fortune. (2026, March 26). “Exclusive: Anthropic ‘Mythos’ AI model representing ‘step change’ in power revealed in data leak.” Fortune. https://fortune.com/2026/03/26/anthropic-says-testing-mythos-powerful-new-ai-model-after-data-leak-reveals-its-existence-step-change-in-capabilities/ ↩︎
LLM Stats. (2026, April 7). “Claude Mythos Preview: Benchmarks, Pricing & Project Glasswing.” llm-stats.com. https://llm-stats.com/blog/research/claude-mythos-preview-launch ↩︎ ↩︎
NxCode. (2026, April 7). “Claude Mythos Benchmarks Explained: 93.9% SWE-bench & Every Record Broken (2026).” NxCode. https://www.nxcode.io/resources/news/claude-mythos-benchmarks-93-swe-bench-every-record-broken-2026 ↩︎
TechCrunch. (2026, April 7). “Anthropic debuts preview of powerful new AI model Mythos in new cybersecurity initiative.” TechCrunch. https://techcrunch.com/2026/04/07/anthropic-mythos-ai-model-preview-security/ ↩︎ ↩︎
CNN Business. (2026, April 7). “Anthropic’s latest AI model could let hackers carry out attacks faster than ever.” CNN Business. https://edition.cnn.com/2026/04/07/tech/anthropic-claude-mythos-preview-cybersecurity ↩︎ ↩︎ ↩︎ ↩︎ ↩︎
Anthropic. (2026, April 7). “Alignment Risk Update: Claude Mythos Preview.” Anthropic. https://www.anthropic.com/claude-mythos-preview-risk-report ↩︎ ↩︎ ↩︎
Fortune. (2026, April 7). “Anthropic is giving some firms early access to Claude Mythos to bolster cybersecurity defenses.” Fortune. https://fortune.com/2026/04/07/anthropic-claude-mythos-model-project-glasswing-cybersecurity/ ↩︎ ↩︎ ↩︎
9to5Mac. (2026, April 7). “Anthropic unveils powerful Mythos AI model, working with Apple in cybersecurity initiative.” 9to5Mac. https://9to5mac.com/2026/04/07/anthropic-unveils-powerful-mythos-ai-model-working-with-apple-in-cybersecurity-initiative/ ↩︎
VentureBeat. (2026, April 7). “Anthropic says its most powerful AI cyber model is too dangerous to release publicly — so it built Project Glasswing.” VentureBeat. https://venturebeat.com/technology/anthropic-says-its-most-powerful-ai-cyber-model-is-too-dangerous-to-release ↩︎
Medium / AI & Analytics Diaries. (2026, April). “Claude Mythos 5: The First 10-Trillion-Parameter Model.” Medium. https://medium.com/ai-analytics-diaries/claude-mythos-5-the-first-10-trillion-parameter-model-scaling-laws-hit-a-new-milestone-fa542be336f8 (Unverified — not confirmed by Anthropic) ↩︎