Tell HN: Claude Opus 4.7 quota suddenly changed to 0 TPM in Bedrock

An AWS user's access to Claude Opus 4.7 on Bedrock was suddenly revoked, with the quota set to 0 TPM starting May 1st, 2026. AWS support attributed the change to a system update that adjusted access controls based on factors including regional considerations, payment history, and usage patterns. The company recommended migrating to Claude Opus 4.6 as an alternative while pursuing an access restoration review.

Detailed Analysis

AWS Bedrock customers using Anthropic's Claude Opus 4.7 began encountering sudden throttling errors on May 1st, 2026, after the model's quota was silently reset to zero tokens per minute without prior notification. The incident, surfaced on Hacker News, reflects a complaint from at least one production user — apparently serving government customers — whose applications were immediately impaired. The AWS support response confirmed that a "recent system update adjusted access controls," citing regional considerations, payment history, and usage patterns as potential factors governing model access, while simultaneously declining to guarantee restoration of the revoked quota.

The AWS support communication reveals a structural tension in how cloud providers manage access to third-party frontier AI models. Unlike conventional cloud compute resources, access to specific AI models on platforms like Bedrock is subject to a layered governance model — one that involves Anthropic's own access policies, AWS's internal controls, and account-level review processes. The acknowledgment that "accessibility is subject to change automatically based on various factors" suggests an automated or semi-automated access management system operating without robust user-facing transparency mechanisms. This stands in stark contrast to how most enterprise cloud customers expect quota management to work: as a stable, contractually bounded service attribute rather than a dynamically revocable permission.

The recommended fallback — Claude Opus 4.6 with ample quota headroom (3 million tokens per minute on cross-region inference) — points to the likelihood that Opus 4.7's limited availability is not a platform-wide failure but rather a controlled or graduated rollout, possibly still under access restrictions tied to demand management, compliance review, or model maturity considerations. The fact that Opus 4.6 is positioned as an "effective replacement with minimal code changes" suggests the two models share a closely compatible API surface, but the very existence of a newer model that users had already integrated into production implies AWS allowed adoption of Opus 4.7 before its access controls were fully stabilized.

This episode connects to a broader pattern in the AI infrastructure market, where the pace of model releases — particularly from Anthropic, OpenAI, and Google — has outstripped the maturity of enterprise-grade availability guarantees. Cloud providers acting as distribution intermediaries for rapidly iterating frontier models face an inherent challenge: they must balance early access enablement, which drives adoption, against the operational and compliance overhead of supporting models that may not yet have fully defined SLA frameworks. For enterprise and government customers especially, the absence of advance notice for quota changes represents a significant reliability gap that could accelerate demand for direct API access through Anthropic's own platform, or for contractual SLA protections that Bedrock has yet to formalize around frontier model availability.

The incident also underscores the geopolitical and regulatory complexity embedded in AI model distribution. AWS's reference to "regional considerations" as a factor in access revocation hints at the possibility that certain models — particularly those at the frontier — may be subject to export controls, compliance reviews, or usage restrictions tied to customer type (including government entities). As AI models become critical infrastructure for enterprise and public-sector applications, the lack of transparent access governance frameworks from cloud intermediaries like AWS represents a growing operational risk that neither customers nor the broader industry have yet fully reckoned with.

Read original article →

Detailed Analysis

Don't Miss a Deploy