Hosting the Agent SDK - Claude Code Docs

The Claude Agent SDK operates as a long-running process that maintains conversational state and executes commands in a persistent environment, distinguishing it from traditional stateless LLM APIs. Deployment requires container-based sandboxing with specified system requirements including Python 3.10+ or Node.js 18+, 1GiB RAM, 5GiB disk space, and 1 CPU. Four production deployment patterns are available—ephemeral sessions, long-running sessions, hybrid sessions, and single containers—with multiple sandbox providers including Modal, Cloudflare, Daytona, E2B, Fly, and Vercel offering hosting options.

Detailed Analysis

Anthropic's Claude Agent SDK represents a meaningful architectural departure from conventional large language model APIs, introducing stateful, long-running agent processes that require purpose-built hosting infrastructure rather than the stateless HTTP request patterns most developers are accustomed to. The SDK, available in both Python (requiring 3.10+) and TypeScript (requiring Node.js 18+), bundles a native Claude Code binary directly within each package, eliminating the need for separate CLI installations and reducing deployment friction. Minimum resource recommendations of 1 GiB RAM, 5 GiB disk, and 1 CPU per instance reflect the heavier computational footprint of persistent agent processes compared to transient API calls, and outbound HTTPS connectivity to api.anthropic.com remains a non-negotiable network requirement. The SDK additionally supports routing through third-party cloud providers including Amazon Bedrock, Google Vertex AI, and Microsoft Azure, giving enterprises flexibility in how they authenticate and route model traffic in compliance with existing cloud agreements.

The hosting documentation outlines four distinct deployment patterns that correspond to different agent lifecycle and use-case requirements. Ephemeral sessions, which spin up a container per task and tear it down upon completion, suit discrete workloads such as bug investigation, invoice processing, or document translation. Long-running sessions maintain persistent containers and can host multiple Claude Agent processes simultaneously, making them appropriate for proactive agents like email triage systems or high-frequency chatbots that must respond continuously without user prompting. Hybrid sessions introduce a middle architecture where ephemeral containers are hydrated with historical state from a database or the SDK's session resumption features, enabling intermittent-use cases like personal project managers or multi-hour deep research tasks to persist context across container lifecycles. The fourth pattern — co-locating multiple agent processes in a single container — is acknowledged as the least common approach due to coordination overhead, though it serves specialized applications like multi-agent simulations.

The security posture mandated by the documentation centers on container-based sandboxing as a first-class requirement rather than an optional enhancement, signaling that Anthropic considers process isolation and ephemeral filesystems foundational to safe agent deployment. The SDK's support for programmatic sandbox configuration allows developers to enforce command allowlists and path restrictions at runtime, as demonstrated in available Python examples that gate filesystem access and restrict executable commands. Several managed sandbox providers — including Modal, Cloudflare Sandboxes, E2B, and Fly Machines — are explicitly cited as compatible hosting environments, reflecting a broader ecosystem of infrastructure tooling that has matured around AI code execution. The FAQ's note that container costs begin at roughly five cents per hour, and that token costs remain the dominant expense, positions agentic deployments squarely within the economics of sustained model inference rather than lightweight API usage.

This documentation arrives at a moment when the industry is rapidly formalizing what it means to deploy AI agents reliably in production, a challenge that differs substantially from deploying conventional software services. The four-pattern taxonomy Anthropic presents mirrors similar architectural frameworks emerging from competing platforms, suggesting that the field is converging on a shared vocabulary of ephemeral, persistent, and hybrid agent lifecycles. The explicit guidance around idle container timeouts, the `maxTurns` property to prevent infinite loops, and health monitoring through standard backend logging infrastructure reflects hard-won operational lessons being codified into public documentation. For enterprises evaluating agentic platforms, the SDK's multi-cloud authentication support and the availability of a growing roster of compatible sandbox providers substantially lower the barrier to integrating Claude-based agents into existing infrastructure stacks, while the container-isolation requirement draws a clear boundary around acceptable deployment configurations from a security standpoint.

Read original article →

Detailed Analysis

Don't Miss a Deploy