Claude kept hallucinating my Factorio bottlenecks. So I built an MCP that reads your saves.

Savecraft is an open-source MCP server that addresses Claude's hallucinations about game-specific questions by reading actual game saves and providing real data instead of guesses. Supporting games including Diablo II: Resurrected, Factorio, Path of Exile, Magic: The Gathering Arena, and others, it uses reference modules to supply accurate information such as drop rates, calculations, and recipe ratios, available free at savecraft.gg.

Detailed Analysis

Savecraft, an open-source Model Context Protocol (MCP) server built by an independent developer, directly confronts one of the most persistent failure modes in applying large language models to domain-specific tasks: hallucination caused by stale or absent training data. The tool reads actual game save files and feeds verified, real-time game state data to Claude rather than allowing the model to generate plausible-sounding but fabricated answers. Supported titles span a wide range of complexity, from Diablo II: Resurrected and Path of Exile to Factorio, RimWorld, Stellaris, and Magic: The Gathering Arena. In each case, Savecraft routes the analytical heavy lifting to dedicated reference modules — drop tables, Path of Building calculations, 17Lands draft data, and production ratio engines — so that Claude's role is explanation and synthesis rather than numerical invention. The result is a clean division of labor: computation handled by deterministic systems, communication handled by the language model.

The underlying problem Savecraft addresses is structural rather than incidental. Claude and comparable models hallucinate game-specific answers primarily because training data for live, patch-sensitive games degrades rapidly, and the model has no mechanism to distinguish what it knows from what it is confabulating. This is especially acute in games like Factorio, where production ratios depend on precise recipe values that change with updates, or Path of Exile, where build optimality is a function of dozens of interacting numerical systems. By intercepting the query before the model can speculate and substituting verified inputs, Savecraft effectively removes the condition that causes hallucination. The architecture mirrors a broader principle emerging in applied AI tooling: language models perform most reliably when used as reasoning and communication layers over external computation, not as authoritative sources of domain-specific facts.

The MCP protocol itself is central to why this approach is viable at scale. Anthropic's Model Context Protocol, released in late 2024 and rapidly adopted across the developer ecosystem, establishes a standardized interface for AI models to invoke external tools, read structured data sources, and receive verified outputs within a reasoning chain. Savecraft's implementation fits squarely within this paradigm, and it joins a growing catalog of MCP servers purpose-built for specific domains — including factorioctl, which connects Claude directly to live Factorio server instances via RCON, and the Factorio Learning Environment, an open-source research framework that added MCP tool invocation support in mid-2025 for multi-agent factory analysis. The proliferation of these tools signals a maturing understanding among developers that MCP's value is not general-purpose augmentation but precision grounding in contexts where model knowledge is demonstrably unreliable.

One non-trivial engineering consideration the project surfaces is context window efficiency. Research into MCP deployments has shown that tool definitions and large data outputs can consume 40 percent or more of available context, degrading tool selection and increasing the likelihood of the model ignoring returned data. Savecraft addresses this by having reference modules return compact, pre-processed summaries rather than raw save file contents, keeping the information density high and the token footprint low. This architectural discipline — processing intermediates outside the model and delivering only distilled results — is increasingly recognized as essential for MCP servers operating over complex, data-rich domains, and Savecraft's design reflects that emerging best practice.

The broader significance of Savecraft lies in what it demonstrates about the relationship between AI capability and domain-specific tooling. Rather than waiting for foundation models to internalize sufficiently detailed and current game knowledge — a training data problem with no clean solution — the developer externalized the knowledge entirely and used Claude purely as an interface layer. This pattern, sometimes described as retrieval-augmented generation's more structured cousin, is likely to define a significant portion of practical AI deployment in specialized domains over the next several years. Games represent a particularly visible and tractable instance of the problem, but the same architecture applies wherever factual precision matters and training data is inherently perishable: financial modeling, regulatory compliance, live scientific data, and competitive sports analytics all face structurally identical challenges. Savecraft's open-source release makes it a useful reference implementation for developers working in any of those adjacent spaces.

Read original article →

Detailed Analysis

Don't Miss a Deploy