← Google News

Anthropic launches Project Glasswing, an effort to prevent AI cyberattacks with AI - Engadget

Google News · April 7, 2026
Anthropic launches Project Glasswing, an effort to prevent AI cyberattacks with AI Engadget [truncated: Google News RSS provides only a snippet, not full article

Detailed Analysis

Anthropic launched Project Glasswing on April 10, 2026, a major collaborative cybersecurity initiative that deploys its unreleased Claude Mythos Preview model to proactively identify, exploit, and patch vulnerabilities in critical software infrastructure before malicious actors can leverage them. The project brings together an unusually broad coalition of technology and financial heavyweights — including Amazon Web Services, Apple, Broadcom, Cisco, CrowdStrike, Google, JPMorgan Chase, the Linux Foundation, Microsoft, NVIDIA, and Palo Alto Networks — while extending access to more than 40 additional organizations responsible for maintaining critical open-source software. Anthropic is backing the initiative with up to $100 million in usage credits for Mythos Preview access across platforms including Claude API, Amazon Bedrock, Google Vertex AI, and Microsoft Foundry, and has pledged an additional $4 million in donations to open-source security organizations, signaling a substantial financial commitment to treating cybersecurity as a shared infrastructure problem rather than a competitive differentiator.

The technical capabilities underlying Project Glasswing are striking. Claude Mythos Preview, a general-purpose frontier model with particular strengths in coding and agentic tasks, has already discovered thousands of high-severity zero-day vulnerabilities spanning every major operating system and web browser, including flaws that had gone undetected for decades. In controlled evaluations, the model generated working exploits in 181 out of several hundred attempts — a performance that dwarfs its predecessor Claude Opus 4.6, which succeeded in nearly zero percent of comparable attempts — while also proposing patches for the vulnerabilities it uncovered. The dual-use risk inherent in this capability is not lost on Anthropic, which has implemented gated access protocols to limit exposure to the model's exploit-generation abilities, restricting usage to vetted participants scanning their own first-party and open-source systems.

The initiative represents a direct response to a cybersecurity inflection point: AI systems are now surpassing human capabilities in vulnerability detection, meaning the same technologies that defenders can use to harden infrastructure are available — or soon will be — to sophisticated attackers. Anthropic's framing of Project Glasswing as a race to patch before adversaries exploit reflects a calculated bet that defensive deployment of frontier AI, done transparently and collaboratively, can outpace the offensive applications. The emphasis on sharing industry learnings and establishing AI-augmented security standards suggests Anthropic is also attempting to shape emerging norms around how powerful AI models should be responsibly integrated into security workflows, a space where no clear industry consensus yet exists.

Zooming out, Project Glasswing fits into a broader and accelerating trend of technology companies repositioning AI from a productivity enhancer to critical infrastructure protection. The scale and composition of the coalition — spanning cloud hyperscalers, endpoint security firms, financial institutions, and open-source foundations — reflects a recognition that no single organization can secure the interconnected software supply chain alone, especially as AI-enabled threats grow more sophisticated. Security experts have noted that AI's growing role in SaaS ecosystems and third-party integrations dramatically expands attack surfaces, making the collaborative, proactive posture Anthropic is advocating increasingly necessary rather than merely aspirational.

The launch of Project Glasswing also raises substantive questions about governance and accountability that the initiative has yet to fully resolve. Anthropic's decision to grant access to a model capable of generating working exploits — even under carefully gated conditions — places enormous trust in participating organizations' security cultures and internal controls. The dual-use tension at the core of the project is not a theoretical risk: a model that can discover and exploit previously unknown vulnerabilities in every major operating system represents a genuinely novel threat surface if access controls fail or are circumvented. How Anthropic monitors usage, responds to misuse incidents, and evolves its access policies as Mythos Preview's capabilities become better understood will be as consequential as the vulnerabilities the project manages to patch.

Read original article →