You weren't imagining it — Claude Code really did get worse, and Anthropic just explained why - XDA

You weren't imagining it — Claude Code really did get worse, and Anthropic just explained why XDA [truncated: Google News RSS provides only a snippet, not full article

Detailed Analysis

Anthropic publicly acknowledged in an April 23, 2026 engineering blog post that Claude Code, its agentic coding tool, did in fact experience a measurable performance decline — validating widespread user complaints that had been building for weeks. The company identified three distinct engineering changes as the culprits. First, a March 4 update reduced the default reasoning effort level from "high" to "medium" in an attempt to cut latency and address UI freezing issues, a tradeoff that users rejected in favor of higher intelligence defaults; this change was reverted on April 7. Second, a context management bug caused Claude to lose memory of its own reasoning during stale sessions, producing symptoms including forgetfulness, repetitive behavior, unexpected tool choices, cache misses, and accelerated exhaustion of usage limits — a defect that evaded standard code reviews and testing pipelines. Third, an April 16 prompt adjustment designed to reduce verbosity interacted poorly with the other changes and degraded coding quality in ways that were not immediately apparent. Anthropic confirmed fixes were deployed by April 20 in version 2.1.116.

The episode is particularly notable for the gap between user experience and Anthropic's initial response. Before the postmortem, the company reportedly suggested that complaints stemmed from user error or that the observed changes were actually beneficial, a position that generated significant backlash and prompted subscription cancellations among paying users. Anthropic has been careful to draw a distinction between the Claude Code product layer — where the three engineering changes introduced genuine degradation — and the underlying AI models themselves, maintaining that Sonnet 4.6, Opus 4.6, and the API remained unaffected throughout the period. That distinction matters for enterprise trust, but it did little to reassure users who experienced the tool as noticeably worse in daily use, regardless of which layer was responsible.

The broader context surrounding the incident reflects the operational pressures Anthropic faces as Claude Code scales rapidly. The company has simultaneously been managing service outages tied to surging usage, peak-hour rate caps, and a limited rollout of its Mythos model restricted to large enterprise clients. These pressures appear to have contributed to the engineering tradeoffs that produced the March 4 reasoning-effort reduction in the first place — an attempt to improve perceived responsiveness that backfired by sacrificing the output quality users valued most. The context management bug, which evaded internal testing, points to the particular difficulty of catching emergent, session-dependent failures in agentic systems where behavior depends heavily on accumulated state across long interactions.

The incident carries implications for how AI companies communicate with users about product-layer changes versus model-layer changes, a distinction that is technically meaningful but often invisible to end users. Claude Code users paying for premium subscriptions reasonably expect that degradation — whatever its origin — will be acknowledged promptly and transparently. The delay between the onset of complaints and the publication of the postmortem, combined with the initial defensive posture, illustrates a recurring tension in the AI industry: the impulse to protect model reputation can conflict with straightforward accountability for the full stack of software that users actually interact with. Anthropic's eventual detailed postmortem, which laid out root causes, timelines, and remediation steps, represents a more constructive model for handling such incidents, and the company indicated it was also improving internal feedback processes to catch similar issues earlier.

Read original article →

Detailed Analysis

Don't Miss a Deploy