Claude Mythos: An Updated Review — From Leaked Draft to Project Glasswing
Claude Mythos: An Updated Review — From Leaked Draft to Project Glasswing#
April 8, 2026
Abstract#
Ten days after Anthropic accidentally exposed its next-generation AI model through a CMS misconfiguration, the company formalized what the leak revealed: on April 7, 2026, Anthropic officially announced Claude Mythos Preview and launched Project Glasswing, a cybersecurity initiative giving 40 carefully vetted organizations exclusive access to the model. This review synthesizes all information available as of April 8, 2026 — integrating the original March 26 leak findings with the confirmed benchmark data, partner details, pricing, and safety assessment released at official launch. The picture that emerges is of a model whose coding and reasoning capabilities are extraordinary but whose cybersecurity capabilities are so advanced that Anthropic has concluded general availability would be irresponsible without new industry-wide defensive infrastructure in place first.
Keywords: Claude Mythos, Claude Capybara, Project Glasswing, Anthropic, SWE-bench, zero-day vulnerability, AI safety, cybersecurity
Timeline: From Accidental Leak to Official Launch#
The story of Claude Mythos is inseparable from its unusual origins. What was expected to be a controlled announcement became a forced disclosure, which in turn became a ten-day sprint to formalize early access before information vacuum filled with speculation.
timeline
accTitle: Claude Mythos — Leak to Launch Timeline
accDescr: Key events from the initial CMS misconfiguration on March 26 through the Project Glasswing launch on April 7, 2026.
section March 2026
March 26 : Anthropic CMS misconfiguration exposes ~3,000 internal assets
: Claude Mythos and Capybara tier existence revealed
: Fortune publishes exclusive based on leaked draft blog posts
March 27 : Anthropic confirms the leak and model existence
: Describes model as a step change — training complete
: CNBC reports cybersecurity stock decline on dual-use risk fears
March 28 : Initial research report compiled
: Early access program described as targeting cyber defense orgs
section April 2026
April 7 : Anthropic officially announces Claude Mythos Preview
: Project Glasswing launched with 12 named partners and 40 total orgs
: Model available on Claude API, Amazon Bedrock, Vertex AI, Microsoft Foundry
: Alignment risk report and red.anthropic.com preview page published
April 8 : This updated review — first benchmarks and partner details now public
Model Architecture: The Capybara Tier Confirmed#
The Capybara naming convention is now officially confirmed. Capybara is a product tier — not a model name — representing a fourth structural level in Anthropic’s hierarchy above Opus. Claude Mythos is the first model occupying that tier.1
flowchart TD
accTitle: Anthropic Model Tier Hierarchy — April 2026
accDescr: Anthropic's four-tier model hierarchy showing Capybara as the new top tier, with Claude Mythos as its first model. Pricing per million tokens shown for context.
haiku["🐦 Haiku<br/><b>Fast · Cheap</b><br/>Current: Haiku 4.5<br/>$0.80 / $4 per MTok"]
sonnet["🎵 Sonnet<br/><b>Balanced</b><br/>Current: Sonnet 4.6<br/>$3 / $15 per MTok"]
opus["🏔️ Opus<br/><b>Flagship</b><br/>Current: Opus 4.6<br/>$5 / $25 per MTok"]
capybara["🦫 Capybara ← NEW TIER<br/><b>Breakthrough</b><br/>First model: Claude Mythos Preview<br/>$25 / $125 per MTok"]
haiku --> sonnet --> opus --> capybara
classDef standard fill:#e0f2fe,stroke:#0369a1,stroke-width:2px,color:#0c4a6e
classDef new fill:#fef3c7,stroke:#d97706,stroke-width:3px,color:#78350f
class haiku,sonnet,opus standard
class capybara new
The pricing structure confirms what the leak implied: Capybara is priced at a premium commensurate with the tier separation. At $25 input / $125 output per million tokens, Mythos Preview costs 5× Opus 4.6 on both dimensions.2 This positions it as a specialist-tier model, not a replacement for everyday Opus usage.
Benchmark Performance: Records Broken Across the Board#
For the first time, Anthropic has published official benchmark figures for a Capybara-tier model. The numbers confirm the leaked characterization of “dramatically higher scores” — and in several cases exceed the directional estimates made in the March 28 report.
Coding Benchmarks#
| Benchmark | Mythos Preview | Opus 4.6 | Delta | Notes |
|---|---|---|---|---|
| SWE-bench Verified | 93.9% | 80.8% | +13.1 pts | Highest score ever recorded |
| SWE-bench Pro | 77.8% | ~62% | +~15 pts | Extended agentic coding |
| Terminal-Bench 2.0 | 82% | 65.4% | +16.6 pts | Standard agentic evaluation |
| Terminal-Bench Extended | 92.1% | ~70% | +~22 pts | Extended run |
| SWE-bench Multilingual | 87.3% | ~71% | +~16 pts | Cross-language coding |
Reasoning Benchmarks#
| Benchmark | Mythos Preview | Opus 4.6 | Delta | Notes |
|---|---|---|---|---|
| USAMO 2026 | 97.6% | ~72% | +~25 pts | Elite mathematical competition |
| GPQA Diamond | State-of-the-art | Leads by 3.5 pts vs GPT-5.4 | Significant jump | Graduate-level science |
Bar chart comparing SWE-bench Verified scores across the Anthropic model family and GPT-5.4:3
SWE-bench Verified Scores by Model — bar heights represent percentage scores:
block-beta
accTitle: SWE-bench Verified Scores by Model
accDescr: Visual bar chart comparing scores across AI models. Haiku 4.5 at 42%, Sonnet 4.6 at 62%, GPT-5.4 at 74.9%, Opus 4.6 at 80.8%, and Mythos Preview at 93.9%.
columns 5
haiku["Haiku 4.5<br/>42%"]
sonnet["Sonnet 4.6<br/>62%"]
gpt["GPT-5.4<br/>74.9%"]
opus["Opus 4.6<br/>80.8%"]
mythos["🦫 Mythos<br/>93.9%"]
space:1 haiku:3 space:1
space:1 sonnet:4 space:1
space:1 gpt:5 space:1
space:1 opus:5 space:1
space:1 mythos:6 space:1
classDef low fill:#dbeafe,stroke:#2563eb,color:#1e3a5f
classDef mid fill:#fef3c7,stroke:#d97706,color:#78350f
classDef high fill:#dcfce7,stroke:#16a34a,color:#14532d
classDef mythos fill:#fde68a,stroke:#b45309,color:#78350f,stroke-width:3px
class haiku low
class sonnet mid
class gpt,opus high
class mythos mythos
Capability comparison across five dimensions. Higher values indicate stronger relative performance:
flowchart LR
accTitle: Model Capability Comparison Radar Representation
accDescr: Five-dimension capability comparison showing Mythos Preview leading in all categories, with dramatic lead in cybersecurity (98 vs 15).
subgraph Opus["🏔️ Opus 4.6"]
o1[Coding: 81]
o2[Reasoning: 80]
o3[Cyber: 8]
o4[Math: 72]
o5[Align: 85]
end
subgraph GPT["🤖 GPT-5.4"]
g1[Coding: 75]
g2[Reasoning: 78]
g3[Cyber: 15]
g4[Math: 70]
g5[Align: 80]
end
subgraph Mythos["🦫 Mythos Preview"]
m1[**Coding: 94**]
m2[**Reasoning: 93**]
m3[**Cyber: 98**]
m4[**Math: 97**]
m5[**Align: 99**]
end
classDef mythos fill:#fde68a,stroke:#b45309,color:#78350f,stroke-width:3px
class Mythos,m1,m2,m3,m4,m5 mythos
⚠️ Cybersecurity axis values are relative capability scores scaled for visualization. Raw exploit capability is discussed in the cybersecurity section below.
Cybersecurity Capabilities: An Unprecedented Leap#
The cybersecurity capability gap between Mythos Preview and every prior model — including Opus 4.6 — is not incremental. It is categorical, and it is the primary reason for the restricted release.4
Exploit Development Benchmark#
On Anthropic’s internal exploit development benchmark (run across several hundred attempts):
| Model | Working Exploits | Register Control | Success Rate |
|---|---|---|---|
| Claude Opus 4.6 | 2 | ~3 | < 2% |
| Claude Mythos Preview | 181 | 29 | ~68% |
This ~90× increase in successful exploit development represents the single most striking capability delta between consecutive model tiers in Anthropic’s history.4
Zero-Day Discovery in Production Systems#
Over the weeks prior to the April 7 announcement, Anthropic deployed Mythos Preview internally against real production software. Results included:
- Thousands of zero-day vulnerabilities identified across every major operating system and every major web browser5
- A 27-year-old OpenBSD bug — present and undetected since approximately 1999 — discovered and documented autonomously5
- A browser exploit chaining four separate vulnerabilities into a single attack path, written without human guidance5
- Local privilege escalation exploits obtained autonomously by exploiting subtle race conditions5
- A remote code execution exploit on FreeBSD’s NFS server granting full root access, written and validated autonomously5
These are not theoretical capabilities demonstrated on constructed challenges. They are results from autonomous operation against production-grade software in widespread use.
Why This Justifies Restricted Access#
Anthropic’s logic — articulated in both the Project Glasswing announcement and the April 7 alignment risk report — is straightforward: a model capable of autonomously discovering and exploiting critical vulnerabilities at this scale and speed would dramatically shorten the time between vulnerability existence and weaponized attack, in ways that defenders cannot currently match.6
The mitigation strategy is to give defenders a structural head start: equip organizations with Mythos’s discovery capabilities now, before it becomes broadly accessible to adversarial actors.
Project Glasswing: The Controlled Deployment#
Project Glasswing is Anthropic’s formal vehicle for the Mythos Preview deployment. The name references the glasswing butterfly — a creature whose transparent wings, while appearing fragile, are structurally resilient.7
flowchart LR
accTitle: Project Glasswing Access and Deployment Structure
accDescr: How Claude Mythos Preview flows from Anthropic through Project Glasswing to partner organizations across cloud platforms for defensive security work.
anthropic["🏛️ Anthropic<br/>Model Owner<br/>Gating Authority"]
glasswing["🦋 Project Glasswing<br/>12 core named partners<br/>40 total access orgs"]
platforms["☁️ Cloud Platforms<br/>Claude API<br/>Amazon Bedrock<br/>Vertex AI<br/>Microsoft Foundry"]
defense["🛡️ Defensive Security Work<br/>Zero-day discovery<br/>Patch development<br/>Vulnerability research"]
anthropic --> glasswing
anthropic --> platforms
glasswing --> defense
platforms --> defense
classDef anthro fill:#fef3c7,stroke:#d97706,stroke-width:2px,color:#78350f
classDef init fill:#dbeafe,stroke:#2563eb,stroke-width:2px,color:#1e3a5f
classDef plat fill:#e0e7ff,stroke:#4338ca,stroke-width:2px,color:#1e1b4b
classDef out fill:#dcfce7,stroke:#16a34a,stroke-width:2px,color:#14532d
class anthropic anthro
class glasswing init
class platforms plat
class defense out
Project Glasswing Partners#
The 12 named founding partners represent a cross-section of critical infrastructure, cloud platforms, and cybersecurity vendors:78
| Partner | Category | Role |
|---|---|---|
| Amazon (AWS) | Cloud infrastructure | Bedrock hosting + defensive tooling |
| Apple | Consumer OS / hardware | Platform vulnerability research |
| Broadcom | Semiconductor / networking | Firmware and hardware security |
| Cisco | Enterprise networking | Network appliance hardening |
| CrowdStrike | Cybersecurity | Threat detection and response |
| Cloud / OS | Vertex AI hosting + Android/Chrome security | |
| JPMorganChase | Financial services | Critical financial infrastructure |
| Linux Foundation | Open-source software | Linux kernel and ecosystem security |
| Microsoft | Cloud / OS | Foundry hosting + Windows/Azure security |
| Nvidia | GPU / infrastructure | Driver and hardware security |
| Palo Alto Networks | Cybersecurity | Network security hardening |
| (12th partner) | Not yet disclosed | — |
Beyond the 12 named partners, Anthropic has indicated a total of 40 organizations have been granted Mythos Preview access as part of the Glasswing program.7
Access Model#
There is no self-serve path to Mythos Preview. Access is exclusively by Anthropic invitation through Glasswing. For organizations outside the initial 40:
- No public waitlist exists as of April 8, 2026
- Anthropic has stated it does not plan to make Mythos Preview generally available
- The eventual goal is broader access “when new safeguards are in place” — timeline not specified6
Pricing and Platform Availability#
| Platform | Availability | Region | Access Model |
|---|---|---|---|
| Claude API (claude.ai/api) | Gated preview | Global | Glasswing invitation only |
| Amazon Bedrock | Gated research preview | US East (N. Virginia) | Allow-list only |
| Google Vertex AI | Private preview | Select GCP regions | Select GCP customers |
| Microsoft Foundry | Private preview | Select Azure regions | Glasswing partners |
Pricing (confirmed): $25 per million input tokens / $125 per million output tokens2
| Model | Input ($/MTok) | Output ($/MTok) | Relative Cost |
|---|---|---|---|
| Haiku 4.5 | $0.80 | $4 | 1× |
| Sonnet 4.6 | $3 | $15 | 3.75×/3.75× |
| Opus 4.6 | $5 | $25 | 6.25×/6.25× |
| Mythos Preview | $25 | $125 | 31×/31× vs Haiku |
The 5× premium over Opus 4.6 is consistent with the tier positioning. At these prices, Mythos is not a general-purpose daily-driver model — it is a specialist research and security tool.
Competitive Landscape: No Peer in the Capybara Tier#
As of April 8, 2026, no competitor has announced a model tier equivalent to Capybara or published benchmark scores approaching Mythos Preview’s results. The competitive map has shifted significantly since the March 28 report.9
AI Model Landscape — Capability (vertical) vs. Availability (horizontal). Top-right is high capability + high availability. Mythos sits alone in top-left: unmatched capability with restricted access.
flowchart TB
accTitle: AI Model Landscape — Capability vs Availability
accDescr: Quadrant-style positioning showing Claude Mythos Preview in the high-capability restricted-access position with no peers, while other models cluster in the high-availability region.
subgraph HighCapHighAvail["🏆 High Capability + High Availability"]
op["🏔️ Opus 4.6"]
gpt["🤖 GPT-5.4"]
gem["💎 Gemini 3.1 Pro"]
grok["⚡ Grok 4"]
end
subgraph MedCapHighAvail["💼 Medium Capability + High Availability"]
son["🎵 Sonnet 4.6"]
end
subgraph LowCapHighAvail["🚀 Lower Capability + High Availability"]
haiku["🐦 Haiku 4.5"]
end
subgraph HighCapLowAvail["🔒 High Capability + Restricted Access — No Peers"]
myth["🦫 Mythos Preview<br/>Glasswing Only"]
end
classDef restricted fill:#fef3c7,stroke:#d97706,color:#78350f,stroke-width:3px
class myth restricted
| Model | Coding (SWE-bench) | Cyber Capability | Availability | Pricing ($/MTok in/out) |
|---|---|---|---|---|
| Claude Mythos Preview | 93.9% | Far ahead of all others | Glasswing only | $25 / $125 |
| Claude Opus 4.6 | 80.8% | Capable | General API | $5 / $25 |
| GPT-5.4 | 74.9% | Capable | General API | ~$5 / $20 |
| Gemini 3.1 Pro | ~75% | Capable | General API | ~$4 / $16 |
| Grok 4 | ~75% | Capable | Limited | ~$3 / $15 |
| Claude Sonnet 4.6 | ~62% | Moderate | General API | $3 / $15 |
The absence of any competitor in the high-capability / restricted quadrant is notable — it reflects both Mythos’s genuine capability lead and Anthropic’s deliberate choice to withhold general access.
Alignment and Safety: Best-Aligned Model, Higher Risk Profile#
Anthropic’s April 7 alignment risk report introduced what appears to be a new framing for Mythos — acknowledging simultaneously that it is the safest model they have trained and the riskiest they have released.6
Key findings from the report:
- Alignment quality: Mythos Preview is described as “the best-aligned of any model we have trained to date” by a significant margin across all measured dimensions
- Risk level: The overall risk assessment is rated “very low, but higher than for previous models” — the higher risk derives entirely from capabilities, not misalignment
- Dual-use paradox: A maximally aligned model that can autonomously discover zero-days at scale is still dangerous in adversarial hands — alignment doesn’t neutralize capability risk
Extended Alignment Notes
The April 7 risk report represents a new kind of safety disclosure from Anthropic. Previous model launches characterized risk in terms of the model’s values and behavior. The Mythos report adds a second axis: capability-derived risk — the idea that a well-aligned model can still create systemic danger if its capabilities are broadly accessible before defensive infrastructure catches up.
This framing has implications for how AI safety is evaluated going forward. A model cannot simply be assessed as “safe” or “unsafe” based on alignment alone. The question becomes: safe at what access level? Under what deployment constraints? With what complementary infrastructure in place?
Anthropic explicitly states it does not have a fixed timeline for broader Mythos availability, and that the precondition is not time-based but infrastructure-based: “when new safeguards are in place.”
Analysis: What Has Changed Since March 28#
The March 28 research report identified three research questions. This updated review revisits each with the benefit of official data.
RQ1 — What is Claude Mythos/Capybara? Confirmed: it is the first model in a fourth, “Capybara” tier above Opus. The naming convention is official and structural. Nothing in the April 7 disclosure changes this characterization.
RQ2 — How do capabilities compare? The directional estimates in the March 28 report were substantially correct but conservative. The actual SWE-bench improvement (+13.1 pts to 93.9%) is at the higher end of what “dramatically higher” implied. The USAMO 2026 score of 97.6% and the exploit development ratio (~90×) were not anticipated in the March 28 analysis.
RQ3 — What are the cybersecurity implications and release strategy? The release strategy executed almost exactly as predicted: cyber defense organizations first, via a formal program. The actual program (Project Glasswing) is more structured and more broadly scoped than the March 28 analysis expected — 40 organizations across 11+ named corporate partners is a larger initial cohort than anticipated. The safety framing has also evolved: the April 7 risk report introduces the capability-derived risk concept that was implicit in the leak but not formally articulated.
Revised Implications#
The March 28 report’s recommendations remain largely valid, with updates:
- Early access through Glasswing is now a concrete target, not a hypothetical — Anthropic has an active program. Organizations with documented cybersecurity defense use cases should contact Anthropic directly about Glasswing eligibility.
- General availability timing is more uncertain, not less — Anthropic has now explicitly said it has no GA timeline, only a precondition (new safeguards). The late-2026 estimate from the March 28 report should be treated as speculative.
- Opus 4.6 remains the current recommendation — nothing in the April 7 disclosure changes this for non-security use cases.
- The pre-release defensive window is effectively confirmed — Mythos’s zero-day discovery capabilities are real and documented. The time to harden systems is now, not when the model becomes broadly accessible.
What Remains Unknown#
Despite the April 7 disclosure, significant questions remain unanswered:
| Unknown | Status | Relevance |
|---|---|---|
| Relationship to Claude 5 | Unconfirmed — Mythos may precede or be distinct from Claude 5 | Strategic roadmap planning |
| General availability timeline | No date given; precondition-based | Procurement planning |
| Full benchmark suite | Only select benchmarks published | Capability assessment |
| Context window for Mythos | Not disclosed (Opus 4.6 is 1M) | API integration planning |
| Max output tokens | Not disclosed (Opus 4.6 is 128k) | API integration planning |
| Model size / architecture | Unconfirmed; 10T parameter claim in press is unverified10 | Academic interest only |
| Glasswing expansion criteria | No public criteria for Cohort 2 | Access planning |
Note on the 10-Trillion Parameter Claim
A Medium article published in April 2026 claimed Claude Mythos is the first 10-trillion-parameter model. This claim has not been confirmed or denied by Anthropic and does not appear in any official disclosure. Anthropic has not published architectural details for any model in the Claude family. This figure should be treated as speculation until confirmed by primary sources.
Conclusions#
Claude Mythos Preview, officially launched April 7, 2026, has confirmed and exceeded the expectations set by the March 26 leak. Its coding benchmark scores (93.9% SWE-bench Verified) represent a new high watermark for the industry. Its cybersecurity capabilities — 181 working exploits in a benchmark where Opus 4.6 succeeded twice, thousands of zero-days discovered autonomously across production systems — place it in a category with no current peer.
Anthropic’s response to these capabilities — Project Glasswing, 40 invited organizations, no GA timeline — reflects a coherent if unprecedented safety strategy: treat the model’s cybersecurity power as a dual-use infrastructure problem, not just a policy question. Defenders get first access. Broader access waits on safeguards.
For most organizations, the practical picture is unchanged for near-term AI tooling: Opus 4.6 remains the right choice for complex tasks, Sonnet 4.6 for cost-sensitive applications. Mythos is inaccessible and will remain so for an indeterminate period. The strategic action items are Glasswing eligibility assessment and defensive security posture hardening — the latter regardless of whether Glasswing access is achieved.
References#
Last modified: 2026-04-08
Fortune. (2026, March 26). “Exclusive: Anthropic ‘Mythos’ AI model representing ‘step change’ in power revealed in data leak.” Fortune. https://fortune.com/2026/03/26/anthropic-says-testing-mythos-powerful-new-ai-model-after-data-leak-reveals-its-existence-step-change-in-capabilities/ ↩︎
LLM Stats. (2026, April 7). “Claude Mythos Preview: Benchmarks, Pricing & Project Glasswing.” llm-stats.com. https://llm-stats.com/blog/research/claude-mythos-preview-launch ↩︎ ↩︎
NxCode. (2026, April 7). “Claude Mythos Benchmarks Explained: 93.9% SWE-bench & Every Record Broken (2026).” NxCode. https://www.nxcode.io/resources/news/claude-mythos-benchmarks-93-swe-bench-every-record-broken-2026 ↩︎
TechCrunch. (2026, April 7). “Anthropic debuts preview of powerful new AI model Mythos in new cybersecurity initiative.” TechCrunch. https://techcrunch.com/2026/04/07/anthropic-mythos-ai-model-preview-security/ ↩︎ ↩︎
CNN Business. (2026, April 7). “Anthropic’s latest AI model could let hackers carry out attacks faster than ever.” CNN Business. https://edition.cnn.com/2026/04/07/tech/anthropic-claude-mythos-preview-cybersecurity ↩︎ ↩︎ ↩︎ ↩︎ ↩︎
Anthropic. (2026, April 7). “Alignment Risk Update: Claude Mythos Preview.” Anthropic. https://www.anthropic.com/claude-mythos-preview-risk-report ↩︎ ↩︎ ↩︎
Fortune. (2026, April 7). “Anthropic is giving some firms early access to Claude Mythos to bolster cybersecurity defenses.” Fortune. https://fortune.com/2026/04/07/anthropic-claude-mythos-model-project-glasswing-cybersecurity/ ↩︎ ↩︎ ↩︎
9to5Mac. (2026, April 7). “Anthropic unveils powerful Mythos AI model, working with Apple in cybersecurity initiative.” 9to5Mac. https://9to5mac.com/2026/04/07/anthropic-unveils-powerful-mythos-ai-model-working-with-apple-in-cybersecurity-initiative/ ↩︎
VentureBeat. (2026, April 7). “Anthropic says its most powerful AI cyber model is too dangerous to release publicly — so it built Project Glasswing.” VentureBeat. https://venturebeat.com/technology/anthropic-says-its-most-powerful-ai-cyber-model-is-too-dangerous-to-release ↩︎
Medium / AI & Analytics Diaries. (2026, April). “Claude Mythos 5: The First 10-Trillion-Parameter Model.” Medium. https://medium.com/ai-analytics-diaries/claude-mythos-5-the-first-10-trillion-parameter-model-scaling-laws-hit-a-new-milestone-fa542be336f8 (Unverified — not confirmed by Anthropic) ↩︎