I built a desktop widget to see your Claude session + weekly usage without opening claude.ai

A developer created a desktop widget that displays Claude API usage statistics in an always-on-top overlay, showing real-time session and weekly usage percentages, usage forecasts, per-model cost breakdowns, cache savings, and active sessions without requiring navigation to claude.ai. The tool offers multiple color themes, supports webhook alerts for Slack and Discord, and reads authentication from existing Claude Code OAuth tokens. Installation is available via pip with no additional dependencies, and the software is open-source under MIT licensing with 218 tests and zero telemetry.

Detailed Analysis

A community developer has released `claude-usage-widget`, an open-source desktop overlay tool designed to surface Claude's session and weekly usage statistics as a persistent, always-on-top display without requiring users to manually navigate to the claude.ai settings page. Installable via a single `pip install claude-usage-widget` command on Linux, macOS, and Windows, the widget reads usage data directly from Claude's OAuth token stored in `~/.claude` — the same credential used by Claude Code — eliminating the need for a separate login step. The tool surfaces rate-limit header data that Claude.ai itself uses to display usage figures, ensuring parity with the official interface. Beyond raw percentage counters, the widget includes a usage forecast engine, a 5-hour sparkline, a 90-day heatmap, per-model cost breakdowns across input, output, cache read, and cache write dimensions, active session tracking for Claude Code projects, and webhook integrations for Slack and Discord threshold alerts. Multiple visual themes — including Catppuccin, Dracula, Nord, and Gruvbox — are supported. The project is MIT-licensed and ships with 218 tests and an explicit zero-telemetry policy.

The motivation behind the tool reflects a friction point that has emerged as Claude's usage-limit system has become more consequential for power users, particularly those relying on Claude Code for sustained development workflows. Anthropic's rate-limiting architecture operates on both session-level and weekly-level quotas, and for users running intensive coding or analysis sessions, the gap between hitting a limit and anticipating one can meaningfully disrupt productivity. The widget's forecast feature — projecting time-to-limit based on current consumption rate — addresses exactly this anticipatory gap, shifting users from reactive to proactive limit management. The cost breakdown panel, which performs the dollar-math on cache read and cache write separately, also speaks to a user base sophisticated enough to be optimizing prompt-caching strategies for cost efficiency.

The technical approach of reading directly from API rate-limit response headers is notable because it bypasses any unofficial scraping of the claude.ai frontend and instead relies on the same structured signals the official interface consumes. This gives the widget a degree of durability against frontend UI changes while keeping it tightly coupled to the actual quota enforcement layer. The dependency on the Claude Code credential file (`~/.claude`) means the tool's user base is effectively scoped to Claude Code users, a segment of Anthropic's customer base that skews toward developers and high-frequency API consumers — precisely the cohort most likely to benefit from granular usage visibility and webhook-based alerting systems.

This release fits within a broader pattern of the developer community building observability and cost-management tooling around frontier AI systems faster than the companies themselves ship native solutions. Anthropic currently offers no official desktop widget for usage monitoring; its Android widget is limited to chat initiation shortcuts, and Claude Code's desktop application focuses on session and file management rather than consumption analytics. Third-party tools filling this gap — especially when they expose cost data tied to caching behavior — signal that the market of developers treating Claude as production infrastructure has matured to the point of demanding operational dashboards. The emergence of such tooling also implicitly pressures AI providers to consider whether richer native usage telemetry should be a first-party offering, particularly as multi-model, multi-session workflows make limit tracking more cognitively demanding for end users.

Read original article →

Detailed Analysis

Don't Miss a Deploy