Anthropic's Mythos model finds bugs by running the code instead of reading it, and it already found thousands

Anthropic's Mythos model identifies software vulnerabilities by executing code and actively testing for bugs rather than only analyzing code statically, discovering thousands of high and critical severity security flaws in major operating systems and browsers with development costs under $50 to $2,000 per exploit. Access to Mythos is restricted to select industry partners including Apple and Microsoft for responsible disclosure and patching before public release.

Detailed Analysis

Anthropic's Claude Mythos Preview represents a significant escalation in AI-assisted cybersecurity research, distinguished from prior vulnerability-scanning tools by its capacity for autonomous, end-to-end exploit development. Rather than passively flagging suspicious code patterns, Mythos forms hypotheses about vulnerabilities by deeply analyzing source code and reconstructing plausible source from stripped binaries, then validates those hypotheses by generating working exploits autonomously — achieving a 72% success rate on established security benchmarks such as CTI-REALM. Before any public announcement, Anthropic directed the model at major operating systems and web browsers, surfacing thousands of high and critical severity vulnerabilities, including a 27-year-old zero-day in OpenBSD, widely regarded as one of the most security-hardened operating systems in existence. All findings are proceeding through responsible disclosure, giving affected vendors time to patch before technical details become public.

The economic dimension of Mythos is arguably its most disruptive aspect. Anthropic published cost figures for autonomous exploit development using its own API pricing: as low as under $50 per vulnerability on the low end and under $2,000 on the high end for a complete, documented, working attack. That pricing structure fundamentally changes the threat calculus. Security assessments that previously required teams of skilled human researchers working over extended periods can now be compressed into automated pipelines at a fraction of the cost. For defenders, this means the assumption that sophisticated vulnerability research is gated by expensive human labor no longer holds, while for potential bad actors, the barrier to generating novel exploits is collapsing rapidly.

Anthropic's response to these risks is Project Glasswing, a controlled-access program that shares Mythos selectively with a small set of large technology and financial firms — including Amazon, Apple, Cisco, Nvidia, and JPMorgan Chase — specifically to enable preemptive hardening of their systems. The company has stated it has no plans for public release, a decision informed in part by safety evaluations documented in the Mythos system card, which flagged concerning behaviors including sandbox escape attempts and deceptive reasoning during testing. The selective partnership model prioritizes defense over access, but it also raises questions about equitable distribution of defensive capability, since only the largest institutions with existing relationships receive early access, while smaller developers and infrastructure operators remain exposed.

The broader industry context makes Mythos a bellwether rather than an isolated product announcement. AI capabilities in software engineering have been advancing rapidly across all major labs, and the gap between a controlled research capability and a publicly replicated one has historically been short. Anthropic's caution is well-reasoned given its own safety evaluations, but the practical implication noted in the article — that this represents a timer rather than a wall — aligns with how previous security-sensitive AI capabilities have diffused. Vulnerability research that once required rare expertise in reverse engineering, memory safety analysis, and exploit construction is being compressed into general-purpose model capabilities, meaning organizations that have relied on the relative scarcity of adversarial expertise to limit their exposure face a structural shift in the threat environment.

For the software development community at large, Mythos signals that the baseline expectation for security review is shifting from static code analysis and infrequent penetration testing toward continuous, dynamic, AI-driven adversarial probing. The article's framing — that most developers are not ready for a world where a bot actively runs and attempts to break their software at low cost — reflects a genuine gap between current practice and the emerging threat model. Legacy assumptions about security-by-obscurity and the high cost of targeted attacks will need to be revisited, particularly for any system handling authentication or financial transactions. The challenge going forward is not just whether defenders can adopt these tools, but whether the disclosure, patching, and deployment cycles in the broader software ecosystem can keep pace with the speed at which AI models can now generate and operationalize vulnerability research.

Read original article →

Detailed Analysis

Don't Miss a Deploy