Why have they removed Adaptive thinking from Sonnet 4.6?

Adaptive thinking was removed from Sonnet 4.6, a feature that previously allowed the model to respond faster for non-complex tasks. Without this adaptive functionality, users face a dilemma: enabling thinking causes unnecessary computation for simple tasks, while disabling it produces subpar responses. This limitation forces pro plan users to keep thinking constantly enabled, which causes them to hit rate limits more rapidly.

Detailed Analysis

A Reddit user posting to r/Anthropic has raised concerns about the apparent removal of "adaptive thinking" from Claude Sonnet 4.6, a feature that previously allowed the model to dynamically determine whether extended chain-of-thought reasoning was necessary for a given task. The post describes a frustrating binary situation: with thinking disabled, response quality degrades noticeably, but with thinking enabled, the model applies intensive reasoning even to simple queries that don't warrant it. This forces the user into a default-on thinking configuration that accelerates consumption of token allocations under Anthropic's Pro subscription plan, leading to premature rate limiting.

The feature the user is mourning — adaptive thinking — represented a meaningful design philosophy in which the model itself served as an intelligent arbiter of computational resource allocation. Rather than treating all prompts as equally demanding, adaptive thinking allowed Claude to match cognitive overhead to task complexity, producing faster and more efficient responses for straightforward queries while reserving deeper reasoning for genuinely complex problems. Its removal in favor of a binary on/off toggle represents a step toward less nuanced user control and potentially higher operational costs for subscribers who rely heavily on the model's reasoning capabilities.

This complaint reflects a broader tension in the development of large language models with extended reasoning features: how to expose powerful but computationally expensive capabilities to users in a way that is both effective and economically sustainable. Extended thinking, as implemented in Claude 3.7 Sonnet and carried forward into the Sonnet 4 series, was a significant differentiator for Anthropic, enabling the model to outperform on multi-step reasoning benchmarks. However, always-on thinking substantially increases token usage — both input and output — which has direct implications for rate limits, inference costs, and API pricing tiers.

The absence of adaptive thinking in Sonnet 4.6 may reflect deliberate product decisions around model architecture, pricing structure, or the difficulty of reliably training a model to self-assess task complexity without introducing inconsistent behavior. Alternatively, it could be a temporary regression as Anthropic iterates on the Sonnet 4.x line. The user's feedback is indicative of a vocal segment of the Claude user base that has grown accustomed to the efficiency gains of adaptive reasoning and views its removal as a functional downgrade, even if the underlying model capabilities in other respects have improved.

The post touches on a recurring theme in AI product development: capability additions do not always translate smoothly across model versions, and features that users come to depend on can disappear without clear public explanation. Anthropic's communication around feature deprecations or architectural changes in its model series has been a recurring point of friction in community forums. As competition intensifies among frontier model providers — with OpenAI, Google, and others all advancing their own reasoning-capable models — user experience decisions like adaptive thinking become meaningful differentiators that influence retention among power users on paid tiers.

Read original article →

Detailed Analysis

Don't Miss a Deploy