← YouTube

Claude Just Solved Session Limits

YouTube · Nate Herk | AI Automation · May 6, 2026
Anthropic announced a partnership with SpaceX to increase compute capacity, enabling the company to double Claude Code's 5-hour rate limits and remove peak hours throttling for Pro and Max accounts. The partnership provides 300 megawatts of capacity and over 220,000 Nvidia GPUs, allowing significant API rate limit increases for Claude Opus models—including a 16% boost to input token capacity and a 10x increase to output token capacity to 80,000 tokens per minute. These changes address months of outages and rate-limiting issues that developers encountered while building with Claude Code and the API.

Detailed Analysis

Anthropic's announcement of a compute partnership with SpaceX marks a significant inflection point in the company's infrastructure strategy, directly addressing months of service degradation that had frustrated developers and enterprise users alike. The partnership, unveiled at the inaugural "Code with Claude" 2026 developer conference spanning San Francisco, London, and Tokyo, enables three immediate operational changes: a doubling of Claude Code's 5-hour rate limits across Pro, Max, and Team subscription tiers; the removal of peak-hours throttling that had been quietly degrading performance during weekday mornings; and a dramatic expansion of API rate limits for Claude Opus models, most notably a tenfold increase in output tokens per minute from 8,000 to 80,000. The scale of these changes reflects not incremental tuning but a structural solution to a capacity crisis that had forced Anthropic into a series of stopgap measures, including temporarily barring new Pro subscribers from accessing Claude Code and restricting third-party tool integrations like OpenClaw and Hermes Agent under terms-of-service enforcement.

The root cause of the preceding turbulence was straightforward: demand for Claude had outpaced Anthropic's available compute at a rate the company could not absorb through organic infrastructure scaling alone. The past quarter saw repeated outages coinciding with the rollout of major model releases including Claude Opus and Mythis, compressing unprecedented user growth against a fixed compute ceiling. The company's response — throttling peak hours, restricting plan access, and discouraging high-consumption third-party agent use — represented rational short-term triage but created friction that threatened developer trust and adoption velocity. The SpaceX deal, alongside a parallel investment offensive involving Amazon, Google, Broadcom, Microsoft, Nvidia, Fluid Stack, and a Goldman Sachs and Blackstone joint venture, signals that Anthropic has moved from reactive capacity management to proactive infrastructure accumulation across both cloud and physical compute layers.

The broader strategic picture reveals an Anthropic that is explicitly positioning itself for enterprise-scale deployment. The Goldman Sachs and Blackstone financial partnerships announced the day before the developer conference, combined with simultaneous international event expansion and the launch of managed agent capabilities featuring webhooks, multi-agent orchestration, and auto-scaling functions, collectively indicate a company building the operational substrate necessary to compete for large institutional contracts. Enterprise clients require guarantees of uptime, throughput, and scalability that Anthropic could not credibly offer during the outage-heavy prior quarter. The compute diversification strategy — spreading dependency across cloud hyperscalers, chip manufacturers, and now aerospace infrastructure — also reduces single-point-of-failure risk in a competitive environment where GPU supply constraints remain a persistent industry-wide challenge.

The partnership with SpaceX in particular carries notable symbolic and practical weight in the context of the current AI infrastructure race. While Anthropic has historically positioned itself as safety-focused and institutionally cautious, aligning with SpaceX's computational assets reflects a pragmatic acknowledgment that frontier AI development is inseparable from large-scale physical infrastructure investment. The tenfold jump in API output token throughput is especially significant for agentic and enterprise use cases where sustained, high-volume model interaction is the norm rather than the exception. As competitors including OpenAI and Google DeepMind continue to expand their own infrastructure footprints, Anthropic's multi-partner compute strategy suggests the company understands that model quality alone is insufficient differentiation — reliable, high-throughput access at scale is now an equally critical competitive dimension.

Read original article →