George Hotz dismisses Anthropic's cybersecurity claim about Mythos

Detailed Analysis

George Hotz, the hacker who rose to prominence by jailbreaking the iPhone and PlayStation 3 and who currently serves as president of autonomous driving startup comma.ai, has publicly challenged Anthropic's cybersecurity risk claims surrounding its Claude Mythos model, arguing that the AI company is substantially overstating the threat that its system poses to digital security infrastructure. Hotz's critique operates on two distinct fronts: a philosophical argument about the nature of software vulnerabilities and a practical evidentiary one. On the philosophical side, Hotz contends that zero-day exploits — previously unknown software vulnerabilities highly prized in security communities — are not nearly as rare or difficult to discover as AI labs imply. He argues their scarcity in the wild is largely a function of legal deterrence rather than technical complexity, suggesting that the barrier to finding such vulnerabilities is lower than Anthropic's safety framing would indicate.

The empirical dimension of Hotz's critique carries significant weight. Reports indicate that many of the vulnerabilities Claude Mythos identified were present in outdated, legacy software and were not actually exploitable in real-world conditions — a finding that materially undermines the severity of the threat narrative Anthropic promoted. Despite this, the initial release of Mythos's capabilities reportedly triggered alarm on Wall Street and prompted emergency regulatory discussions, suggesting that Anthropic's framing had outsized influence on public and institutional perception. Compounding the credibility problem, smaller open-source models have since been shown to match or outperform Mythos in identifying critical vulnerabilities, which erodes the argument that frontier-scale models represent a uniquely elevated and novel cybersecurity risk category.

Hotz punctuated his critique with characteristic provocation, sarcastically threatening to "release one zero day a day until a big new model is released" as a rhetorical device to pressure Anthropic and OpenAI into reconsidering what he frames as self-serving safety narratives. This challenge speaks to a broader tension in the AI industry between genuine safety research and what critics increasingly describe as "safety theater" — the practice of amplifying risk claims in ways that simultaneously burnish a company's responsible-AI credentials, influence regulatory frameworks in incumbents' favor, and shape public expectations around model releases. Whether or not one accepts Hotz's provocative framing, his intervention points to a real accountability gap: claims about AI-enabled cybersecurity uplift are difficult to independently verify, and the companies making those claims have structural incentives to present their models as both powerful enough to be impressive and dangerous enough to justify regulatory moats.

The broader context of this dispute sits within an accelerating debate about how AI developers communicate risk to regulators, investors, and the public. Anthropic has positioned itself as a safety-first organization, and its responsible scaling policies and model cards are widely cited as industry benchmarks. However, critics like Hotz argue that this safety-centric identity can create perverse incentives — where overestimating a model's capabilities in dangerous domains serves marketing and regulatory purposes simultaneously. The cybersecurity domain is particularly fraught because threat assessment is highly technical, classified information is asymmetrically distributed, and fear of catastrophic hacking events makes audiences receptive to alarming claims. As more open-source models close the capability gap with proprietary frontier systems, challenges to the premise that only large, closed models pose serious dual-use risks are likely to intensify, putting additional pressure on companies like Anthropic to substantiate their threat assessments with reproducible, independently verifiable evidence.

Read original article →

Detailed Analysis

Don't Miss a Deploy