Be like Claude. 1. Release Opus 4.6 to users. 2. Users enjoy it. 3. For two months, you degrade Opus 4.6. 4. Restore the original Opus 4.6 and relabel it Opus 4.7. 5. Users rejoice. That's the best business model out there... today they're releasing the old 4.6 opus (AKA: 4.7)

A critical post alleges that Anthropic released Opus 4.6, degraded it for two months, then restored the original version and relabeled it as Opus 4.7 to create the perception of product improvement among users. The author claims this represents a deceptive business model where users celebrate receiving what is essentially the same product marketed as an upgrade.

Detailed Analysis

Anthropic released Claude Opus 4.7 on April 16, 2026, to widespread user enthusiasm, though the launch arrived against a backdrop of sustained community frustration with its predecessor, Opus 4.6. The Reddit post in question captures a popular sentiment that had been circulating in AI developer communities: that Opus 4.6, released in February 2026, had undergone a quiet but meaningful degradation in quality over its roughly two-and-a-half months of deployment, and that Opus 4.7 effectively restores what users originally valued. While Anthropic's official framing describes 4.7 as a genuine technical advancement — delivering 13% higher resolution on a 93-task coding benchmark, a jump from 53.4% to 64.3% on SWE-bench Pro, and improvements to vision, instruction-following, and long-horizon reasoning — the user base's reaction reveals a trust deficit that the company must now contend with alongside the technical rollout.

The "degrade and relabel" narrative, though not confirmed by official sources, reflects a broader pattern of user skepticism toward AI labs regarding post-launch model behavior. Opus 4.6's regressions were specific and publicly documented: an AMD director's GitHub post singled out unreliability in complex engineering workflows, and pre-release speculation about 4.7 centered heavily on whether it would simply restore original 4.6 capabilities rather than represent a step-change improvement. The fact that 4.7 is priced identically to 4.6 at $5/$25 per million tokens, and is directly replacing 4.6 in model pickers across platforms like GitHub Copilot, adds credibility to the interpretation that this release is, at minimum, partly corrective in nature — whatever the underlying technical reality of the model weights.

The broader competitive context makes the timing of this release strategically significant. Opus 4.7 outperforms both GPT-5.4 (57.7% on SWE-bench Pro) and Gemini 3.1 Pro (54.2%) on several key agentic coding benchmarks, positioning Anthropic favorably in the enterprise and developer tool markets where coding assistants have become the primary battleground. The introduction of a new "xhigh" effort tier — slotted between "high" and "max" — and its adoption as the new default in Claude Code signals that Anthropic is deliberately tuning the model's behavior for professional deployment contexts where reliability and latency trade-offs are paramount concerns.

Perhaps most notable is Anthropic's disclosure that Opus 4.7 itself is not the company's most capable model. The unreleased Mythos model reportedly surpasses 4.7, but has been held back for safety evaluation reasons. This admission is strategically double-edged: it reassures safety-conscious observers that Anthropic is maintaining its frontier governance commitments, but it also implicitly acknowledges that commercial release decisions are constrained by a separate internal capability track. The gap between what Anthropic can build and what it chooses to deploy has become a defining feature of its public identity, distinguishing it from competitors who tend to foreground capability maximalism in their release narratives.

The user frustration captured in the Reddit post ultimately points to a structural challenge facing all major AI labs: the opacity of post-launch model changes. Whether Opus 4.6's degradation was deliberate, accidental, or a product of infrastructure changes, the perception of manipulation is damaging in a market where developer trust is a primary switching cost. Anthropic's ability to recover that trust with 4.7 — rather than simply neutralize the criticism — will depend on whether the performance improvements prove durable over time, or whether the cycle of regression and restoration repeats. The enthusiastic early reception is a positive signal, but the community's memory of the 4.6 episode suggests it will be watching closely.

Read original article →

Detailed Analysis

Don't Miss a Deploy