← Reddit

I built an app that preserve your Claude, Codex, and Cursor sessions as high-value data assets

Reddit · homo_sapiens_reddit · May 8, 2026
A developer created DataMoat, an application that preserves, encrypts, and enables searchable access to work transcripts and attachments from AI agents including Claude, Codex, and Cursor. The app stores all vault data locally on the user's machine using AES-256-GCM encryption with password protection, optional TOTP authentication, and BIP39 recovery phrases. Packaged versions are available for macOS with Secure Enclave integration, along with Linux and Windows builds.

Detailed Analysis

A developer operating under the GitHub handle max-ng has released DataMoat, an open-source application designed to capture, encrypt, and make searchable the full work traces generated during sessions with AI coding and reasoning agents such as Anthropic's Claude CLI, OpenAI's Codex CLI, and the Cursor IDE. The tool targets a gap that has emerged as professionals increasingly delegate complex, multi-step tasks to AI agents: the outputs and reasoning chains produced during those sessions are typically ephemeral, stored in fragmented proprietary formats, or silently discarded after the session closes. DataMoat positions itself as a local-first vault that normalizes and indexes these traces for long-term reuse, treating them as first-class data assets rather than throwaway logs.

The application's technical architecture reflects a deliberate privacy-first philosophy. All transcript data, attachments, and session state are stored in an AES-256-GCM encrypted local vault, with vault keys remaining exclusively on the user's machine. There is no cloud upload or server-side storage component, which directly addresses concerns about enterprise confidentiality and personal data sovereignty. The security model is notably layered, combining password-based unlock with an scrypt verifier, optional TOTP two-factor authentication, a 24-word BIP39 recovery phrase, and — on supported Apple hardware — Secure Enclave-backed unlock with Touch ID. The inclusion of BIP39 recovery phrases, a standard borrowed from cryptocurrency wallet infrastructure, signals an audience that is fluent in self-custody security paradigms and skeptical of vendor-controlled data storage.

The scope of supported sources is broad and technically specific, encompassing Claude CLI, Codex CLI and local app sessions, Claude Desktop local-agent sessions on macOS, OpenClaw, and Cursor agent transcripts. Critically, the application also captures locally written thinking and reasoning blocks when the source tool stores them to disk — a feature of particular value to users of models like Claude 3.7 Sonnet and OpenAI's o-series that expose extended chain-of-thought reasoning. These reasoning traces represent among the most analytically rich outputs any AI session produces, and their preservation opens pathways to auditing decision logic, refining prompting strategies, and transferring context across sessions or team members without re-running expensive inference.

DataMoat's emergence reflects a broader and accelerating tension in the AI tooling ecosystem between platform lock-in and user data autonomy. As AI coding agents move from novelty to core professional infrastructure, the session histories they generate begin to constitute genuine institutional knowledge — encoding problem-solving approaches, codebase-specific conventions, and iterative refinements that have real organizational value. Current tooling from Anthropic, OpenAI, and others does not provide robust, portable, user-controlled mechanisms for preserving this knowledge layer. DataMoat is an early attempt to fill that gap at the application layer, outside the control of any AI vendor. The phrase "data moat" in the project name is deliberately borrowed from competitive strategy terminology, suggesting that accumulated, structured AI session history could itself become a durable competitive advantage for individuals and organizations that systematically preserve it.

The project's current limitations — Windows builds remain unsigned, and Linux users must install from source — indicate it is in early but functional stages of development. Its open-source distribution via GitHub and appeal for community stars rather than immediate monetization suggests the developer is prioritizing ecosystem adoption and validation before commercial expansion. If the pattern of AI agent usage continues to intensify across software development, research, and knowledge work, tools like DataMoat point toward an emerging category of infrastructure: personal and organizational AI data management systems that operate independently of the model providers themselves, giving users genuine ownership over the intellectual artifacts that AI-assisted work produces.

Article image Read original article →