Claude Code uses 5% just to launch Claude Code

A user reported that launching Claude Code in VS Code now consumes 5% of their session usage limit on initial startup, whereas it previously consumed 1% or less. Similar excessive usage occurred when checking the Opus 4.8 model in the VS Code extension, which consumed 40% of the session without any prompts being sent, leading the user to suspect the model is being invoked for interface navigation operations. The user noted that selecting Haiku instead of Opus significantly reduces this usage pattern, and questioned whether this widespread issue affects other users.

Detailed Analysis

Claude Code, Anthropic's agentic coding assistant, is drawing user complaints about unexplained token consumption that occurs before any prompts are submitted. A user on the r/ClaudeCode subreddit reports that simply launching the Claude Code terminal interface consumes approximately 5% of their session usage, and that checking the usage via the VS Code extension after the release of Opus 4.8 immediately showed 40% consumption — again, with no user-initiated prompts. The user notes this behavior has existed historically at much lower levels (around 1% or less), but the scale of recent consumption has become significant enough to post about. Their system configuration includes Windows 11, PowerShell 7.6.2, VS Code Insiders, and a Claude Pro subscription.

The user's own investigation points to a likely cause: background model inference triggered by interface navigation itself. Running the `/context` command reveals approximately 30,000 tokens already loaded in the session — 14,000 from memory files and 7,000 from skills — even before a single conversational prompt is sent. This suggests Claude Code's architecture pre-loads contextual information and may invoke the underlying model to process or render interface commands like `/usage` and `/release notes`. The user's observation that switching to the lighter Haiku model reduces passive consumption lends credibility to the theory that a language model is being called during routine UI operations, not merely during user-directed tasks.

This issue carries real practical consequences for Pro subscribers, who operate under session-based usage limits rather than unlimited access. If a non-trivial percentage of a session's token budget is consumed by initialization and interface navigation, users effectively receive less productive capacity per session than the subscription implies. The jump from roughly 1% to 5% or even 40% in passive consumption — particularly coinciding with the rollout of the more capable and likely more resource-intensive Opus 4.8 model — suggests that model upgrades may carry hidden infrastructure cost increases that are not communicated to end users.

More broadly, this reflects a growing tension in agentic AI tooling between ambient intelligence and resource transparency. Tools like Claude Code are designed to maintain persistent context, memory, and skills that make them more capable across sessions, but these features inherently consume tokens in the background. As AI coding assistants become more sophisticated — loading project memory, indexing codebases, and maintaining conversational state — the line between "idle" and "active" resource consumption blurs significantly. Anthropic and competing providers building similar agentic tools will face increasing pressure to give users clearer visibility into what consumes their limits and why, particularly as these tools move deeper into professional and enterprise workflows where cost predictability matters.

Read original article →

Detailed Analysis

Don't Miss a Deploy