Detailed Analysis
Anthropic adjusted Claude's usage limits in March 2026, introducing a time-sensitive throttling mechanism that has left users experiencing what feels like a reduction in available usage — even though the company's weekly caps remain technically unchanged. The change centers on a rolling 5-hour window tied to peak demand hours (05:00–11:00 PT or 13:00–19:00 GMT), during which users across all plan tiers — Free, Pro ($20/month), Max 5x ($100/month), and Max 20x ($200/month) — encounter stricter session limits than they would during off-peak periods. The Reddit user's complaint reflects a genuine shift in user experience: tasks involving multiple file uploads and conversational follow-ups, which previously never triggered a cap, now exhaust the rolling window far more quickly under this new architecture.
The underlying driver of the change is infrastructure demand management. As Claude's user base has grown substantially since early 2025, Anthropic faces the challenge of maintaining service quality during high-traffic windows without dramatically expanding compute in ways that could destabilize pricing. The rolling 5-hour window system is a dynamic throttle rather than a hard weekly ceiling, meaning that the same workflow can succeed or fail depending entirely on when it is executed. This design choice prioritizes aggregate system stability over individual user consistency — a trade-off that explains why longtime users report a sudden degradation in what had been reliable daily workflows.
The technical reality behind this limit sensitivity lies in how large language models handle context. Reports indicate that in long, file-heavy sessions, approximately 98.5% of tokens consumed can be attributed to re-processing the growing conversation history rather than generating new output. When a user uploads three files and asks sequential questions, the model is reprocessing the entire file content and prior exchange with each message. This makes file-centric workflows disproportionately expensive under a rolling session-window model, amplifying the perceived impact of the March 2026 adjustment even for users on paid tiers.
The shift connects to a broader pattern in the commercial AI industry, where providers are increasingly moving away from simple monthly token allowances toward more sophisticated usage shaping mechanisms that blend time-of-day pricing logic with rolling windows and tiered access. OpenAI, Google, and now Anthropic have all introduced rate-limiting architectures that distribute load across time rather than simply capping volume. For Anthropic specifically, this development reflects the tension between its positioning as a safety-focused, research-driven organization and the commercial reality of operating a high-demand consumer product at scale. The introduction of a purchasable overage system alongside the rolling cap signals that Anthropic is also actively exploring monetization mechanisms beyond flat-rate subscriptions, a model that positions heavy users toward the higher Max tiers or pay-as-you-go extensions as a matter of practical necessity rather than choice.
Read original article →